Gene TM1040_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3840 
Symbol 
ID4074903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp86070 
End bp89351 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content53% 
IMG OID638004497 
Producthemolysin-type calcium-binding region 
Protein accessionYP_611232 
Protein GI99077973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0668693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCTTTT GGGAAGGGCA GCTATCTTCC GGCGCGGTGA GTGCCGCTGA TGCGATTGAT 
GCAATTATCA AGGGCGCGAC AACTGCGCCT GATTCAACAA TTCTTACCAA TAAAAATGCT
GTTGGCTTGG ATTTTGCGAC GGACGCAGGT AACACACCTG GCTTCACCTT CGATTTGAAT
GGTGCGTCTG GCAGTGCGGC CAAGAACGCT CTAGCTGGCG TAACCGACGA CGCTGCAACT
GTTACCGCTG CTCAAGCCGC GACAGATGCA TATCTTAGCG GTGTTGCCGG CGCTGGTGAT
ACACTCTCGC TCACCACGGG CGTGGACAAC ATTACTGGGA CTGTCAATCA AGACACGATT
AAGGCTGTTG TGGGCGCCGG CGCCACTCTC TCTGCTGCTG ACGTCGTTGA TGGCGGGGCT
GGCACCGACA CGCTTGAGCT AGTTGATGCG ACCGGTGGGC AATCCTTGGC AGCTGCGCTT
CTCAATGTAA GCAACGTAGA AGAGTTGTCT ATTCGGAGCG TGGGGCAAGC CAAGGCGGAT
ACATCCGGAA CGGCCTTCAC TAGTGTAAAT ATCACACAAG CAACGTCCGT TGATGCTGAT
GTGAACGCTG CAACAAACGT AGGCGTATCT GGCGTAACTG GCGCAATTGC GGTTGATGGC
GGTAAAGACA TCACCATCAC CGACGCTAAC GATGGGGCAA ATATCACAGT CGGTGGTGCG
ACTGCTGTTA CTGGCGCCGT TACGGTTACA GACACTAAAG TTGGCAACGC AGCGATCGAA
GTTAACGGTG GGACAACCGT TACGGTCAAT GCAACCGGCT CATCTGCTAA CAACATTAAA
GTCGGTGGTG CGACCACAAC CGATCAGCCC TCTGGTGCTG TTTCGATTAC ATCTGCGCAT
GCTGCAACAG CAGGTAAAGA TGTGACATCT AGCGCCATTA CAACTGTTGG CGGCTCTAGC
GTTACCATTA CCTCAACTGC TGACACGTCC AAAGCTGCTG CCGATACAAA AGGCGCTACT
TTGACCCAAG GTGATGTAAG CGTCACTGGT GGCGCATCAA CAACTGAAGT AACAGTCAAC
CAAGCTGCAA ACGTTTCTAA GAATGCCGCA GTAGTGGCTG TCGCTGCGAA AGCCTCGACT
CAAAAGCTGA CGTTCACAAA TGCTGCAGCA AATGATGTGA TCACTCTGAC ATTCGACACA
GGTGATACTC TCGTCTTTAC AGCCTCGAAA GCGCTGACCG CAACCGAAGT TGCAACAGCT
TTTGCGAACC TTGCAAAGAA TGCTACTGAA GGTTCTGCAC CGGTTGTCAA TGGTGTATAC
ACCAACGGCG GTACGATCGA TCAAGGTTGG ACCTCGGGTG CTGTAACAGA TGTTAACGCA
GATGATACTT CAGCATCTGT AACATTCAGT AACGAAGCTG CTGGCCCGAC CGCCCTGGTT
GTCGCGAACG GGGGTACTGG TACAGCTACA GCTGCGACAC TTGCGGCCGG AACAGCTGCG
ACGGCTGCTG AAACGGGGCG CCTAGGCGTT ATCGCTGGTA AAGTCACTAT CGATGGCAAC
ATCACAGGCG ATGACGTTCT GAAAACGGTT ACAATCGACG CTTATGAAGA TGGGTCAACT
GTCAAATCGG ACGCATTGGA AACTCTGACG CTGAAAAACG CTGACGAAGA TATCGTAGTG
ACGACGGCGT CTACAGGTGC CATTACCCTG AACTTGGATG CTGTTGCTGG TAACGATGCT
AAAGTCAGCC TTGATGGCAA TGCAGCGACT GTTACTGGTC TGACCATCAA CGCCACCGGC
ACAAAGTCTG ATGCGGAAGT GACTGCAGAT GCTGCAACCA CAGTCACTGT GAATGCAGCG
GTGGACATGG ACCTGACCGG TTCGTCTTTT GACAAAGCAA CCAAGATGGT TGTGACTGGT
GCAGGTAAAG TGACTCTGGA TGGTGATGAT GTTGCGTCTG TCTTGGCTGA AGTTGATGCC
TCGGGCGCGA CTGGTGATGT CGATGCTTCT GGTGTCGCGC TGACAGCTGC TGGTGTTTAC
ACCGGTGGTT CGGGTGCAGA TACGTTCACC GTGACAGCCG CCGCGACCAA AGCAAGCACA
GGTGGTGATG GCGACGACGT GATCACTGTA AGCACGCTCG GTACTGGTGG TTCGGTCGAC
GCGGGCGCAG GAACAGATAC GCTGGTGATG GCAGCTGATG ACGCAGAAAC TGCATCAGCA
ACTGCAGGTT TCAAAGCAAC TAACTTTGAG AAGTTGTCTA TTGGAGCCGT GGCTGACGCC
ACAGCTGGTG CCGACACTGA AACTGTCAAC CTCGCAAACC TCGCCTTTAC TGAGGTAACA
TCAGCAGGTG CTGTAGATGG TGACGAAGAC ATTCTCGCCC TCACTAATGC GGCTTCTGGC
CTCACGCTCA ACATAACAGG TGAAGGCTTG TTTACGGTTG GGGTGAAAGA CGCAGCTACC
AATAAAACAG ATGTTCTGAA TATCGCTTCG AACAGCGATG GTAACTTGGC GTTGGGTACT
GTTACTGCTG CAAACGTCGA AACAATCAAC ATCAACGCAG TTGATAAGGT TGTTGACACA
ACAGGAGCGA CCGATGCGTT CGGTGCGGCT ATTGGTGATG GGAAGGATGA TAACAACTCG
GTTCAAACCC TTACCGTTGA TGCCGGTGCT GCAACTACGG TCAATGTGTC GGGTTCGGCT
GACTTGGATC TTACCGTAAA TACTGCGACT GCTCTTACTG CGGTAAATGC TTCCACTGCA
ACGGGTGCCG TCACATATAC AGCAAATGAC GGCACGACTA CCGTCACTGG TGGTTCGGGT
AACGATAACC TGACTGCGGC CGGGGATAGC GATGTCTTGC TCGGTGGTGC TGGTAACGAC
AAACTGACCG CAGCGACACT GACCACGCTG ACGGGTGGTG AAGGTAACGA TACCTTCGTA
ATGAGTGGCG CAGTTGATGC AACGAAGTAT TCGACAATCA CAGACCTCTC GTCTGGCGAT
ACGATTGATA CAGATGCAAC GGCATTCAAT TCGTCGAAAG TTGTGCTTGC TGCGAACGCC
ACCTTTGCTC AATATCTGGA TGCTGCGATT GTGGCGACGG CAGCACAGGA CGACGCAGCT
TCTTGGTTCC AGTTTGGCGG GAATACCTTC ATTGTGAATG AAGGTACGGA CGCGTCTGTA
ACTTATAATG CTGCAGAAGA CGGTATCATC GGTATTACTG GTCTCGTTGA TCTGTCCGCT
GCGACGTTCA ACGCTACTAA TGGTACGTTT GATATCGCGT AA
 
Protein sequence
MAFWEGQLSS GAVSAADAID AIIKGATTAP DSTILTNKNA VGLDFATDAG NTPGFTFDLN 
GASGSAAKNA LAGVTDDAAT VTAAQAATDA YLSGVAGAGD TLSLTTGVDN ITGTVNQDTI
KAVVGAGATL SAADVVDGGA GTDTLELVDA TGGQSLAAAL LNVSNVEELS IRSVGQAKAD
TSGTAFTSVN ITQATSVDAD VNAATNVGVS GVTGAIAVDG GKDITITDAN DGANITVGGA
TAVTGAVTVT DTKVGNAAIE VNGGTTVTVN ATGSSANNIK VGGATTTDQP SGAVSITSAH
AATAGKDVTS SAITTVGGSS VTITSTADTS KAAADTKGAT LTQGDVSVTG GASTTEVTVN
QAANVSKNAA VVAVAAKAST QKLTFTNAAA NDVITLTFDT GDTLVFTASK ALTATEVATA
FANLAKNATE GSAPVVNGVY TNGGTIDQGW TSGAVTDVNA DDTSASVTFS NEAAGPTALV
VANGGTGTAT AATLAAGTAA TAAETGRLGV IAGKVTIDGN ITGDDVLKTV TIDAYEDGST
VKSDALETLT LKNADEDIVV TTASTGAITL NLDAVAGNDA KVSLDGNAAT VTGLTINATG
TKSDAEVTAD AATTVTVNAA VDMDLTGSSF DKATKMVVTG AGKVTLDGDD VASVLAEVDA
SGATGDVDAS GVALTAAGVY TGGSGADTFT VTAAATKAST GGDGDDVITV STLGTGGSVD
AGAGTDTLVM AADDAETASA TAGFKATNFE KLSIGAVADA TAGADTETVN LANLAFTEVT
SAGAVDGDED ILALTNAASG LTLNITGEGL FTVGVKDAAT NKTDVLNIAS NSDGNLALGT
VTAANVETIN INAVDKVVDT TGATDAFGAA IGDGKDDNNS VQTLTVDAGA ATTVNVSGSA
DLDLTVNTAT ALTAVNASTA TGAVTYTAND GTTTVTGGSG NDNLTAAGDS DVLLGGAGND
KLTAATLTTL TGGEGNDTFV MSGAVDATKY STITDLSSGD TIDTDATAFN SSKVVLAANA
TFAQYLDAAI VATAAQDDAA SWFQFGGNTF IVNEGTDASV TYNAAEDGII GITGLVDLSA
ATFNATNGTF DIA