Gene TM1040_2409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2409 
Symbol 
ID4076735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2548100 
End bp2549374 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content61% 
IMG OID638007731 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_614403 
Protein GI99082249 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.352526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCTA TCTATTCCAT TGCAGATTTC ATCGACATGC TGCGCCGCCG TGTGTCGCTG 
ATCGTTGTCG TCACATTTCT GGGCTGTGTG GTGTCGGTCT GGCTGGCGCT GCAGAAGCAG
CCGATCTATT CCAGCACCGA AGTGATCCAG ATTACCCGTC CAAAGATTGC CGGAGATCTG
GCGCGCTCGA CTGCGGAGGG GTCCTCTGCA CGACGTATCC AATTGATCGA ACAACAGCTG
ATGGCGCGCG GAACGATCCT AGAAATCGTG GATCAGCTTG ACCTCTTTGC GGACCGTCCC
GGGCTTCTGG ATTCCGAAAT TGTCCCCCTG ATGCGCAACT CTGTTTCTTT GATGGGGACC
GCCGCGGCCC GCGAAGGCGG GTCCGACGAT GGCGCGATCT CGGTGCTTAC GATCACCGCG
AATATGCCGA CCCGCGAGCA AGCGCAGGCC GTCGCGCGCG AATTTTCCAA GCGCACCATT
GCCCTCAGCC AGAACACCCG GCTCGCCGAG GCGCGCGAAA CCTTGCTCTT TTTCAATGAA
AAAGAAGCGG CATTGGTGCG TGACATCACG GCGCTTGAGG AAGAGATTGC CAACTTCAGG
CACGAGAACG CCGTGACCTT GCCCGGTGCA ATCGAGATGC GAGGCGCGGA AATTACGGCG
ATCAATGAAA GCCTGCTGGA ACTCGCGCGA CAAGAGATCG AATTGCGCAA AGGTGCTGAG
GAGGCCGAGG CAACACAACG TGAAGCCTAT GCGCGCCGGG TTCGCGAAGA GTTCGACGCC
CAGCTCGAAA GCTTGACAGC GCAGCGCCAG CTGCTTGTGG ATCGCCGCGC CGAACTGGAG
GCGTCTCTCG AGCTCACCCC GGACGTGGAT CGCCAATTGG CCAGCTATGA GCGGCGTCAA
CAGCAGATGC AGTCCGAGCT TGAAGTCATC ACCGCGCGTC GCGCCGAGGC CGAGGTCGGG
TTCCGACTGG AGACCGCCAG TCAAGGCGAG CGTCTGCGGG TGATCGAGCC CGCACCGCTG
CCGGATTATG CGATGGGCGG TGGCCGCAAG TCTTTGGCGA TCAAGGGCGC CCTTGCGAGC
TTTGTCTTGG GGGTTCTTGC GGCCTTTGCC CTAGACTTGC GCCATCCGGT TCTGCGGTCG
AGCGGGCAGA TGAAGCGGGA AACCGGCCTG TCCCCGGTGG TGACGATCCC GGTTCTGACG
ACGCGCAAAA AAGGTCTGTT TGCACGCCTC TTTGCGCTGG GCGGACGCGG CACGGAACGG
CACGCGCGTA GCTAG
 
Protein sequence
MASIYSIADF IDMLRRRVSL IVVVTFLGCV VSVWLALQKQ PIYSSTEVIQ ITRPKIAGDL 
ARSTAEGSSA RRIQLIEQQL MARGTILEIV DQLDLFADRP GLLDSEIVPL MRNSVSLMGT
AAAREGGSDD GAISVLTITA NMPTREQAQA VAREFSKRTI ALSQNTRLAE ARETLLFFNE
KEAALVRDIT ALEEEIANFR HENAVTLPGA IEMRGAEITA INESLLELAR QEIELRKGAE
EAEATQREAY ARRVREEFDA QLESLTAQRQ LLVDRRAELE ASLELTPDVD RQLASYERRQ
QQMQSELEVI TARRAEAEVG FRLETASQGE RLRVIEPAPL PDYAMGGGRK SLAIKGALAS
FVLGVLAAFA LDLRHPVLRS SGQMKRETGL SPVVTIPVLT TRKKGLFARL FALGGRGTER
HARS