Gene TM1040_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3785 
Symbol 
ID4074880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp33104 
End bp35482 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content52% 
IMG OID638004445 
Producthemolysin-type calcium-binding region 
Protein accessionYP_611180 
Protein GI99077921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0346171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA ACGACAGTGG CACTCCAGCT ACCGAAAGCA ATCAGACACT CACAGGAACC 
GGAGCGGACG ACCAGCTATC TGGCGGGCTT GGTGATGATG TCATCACGGG TGGCGCGGGT
GACGATGTTC TGCGTGGTGA CGGTCCAATT GCAGGTTCTT GGCACTATGA AACCTTTGAT
CGCAATTTTA CCTCTGCGAA TGGGCAAGCT TTTGACTTTG GAGGTGATGC AAGCGAGCGC
ACAGGGTCCG GTTATGTCAC TGACTTTGAT GAATCCAGCC TAACAAACGC TATTCGCGGA
ACGACAGGAA ACCCAGGCGA TTTCGGTGTT ATCTACACCA GCACGCTGAA TATTACCGAT
AGTGGCGTCT ATACGTTCGC GACCAATTCT GACGACGGTT CGACAATCCA GATATTCGAC
AGCGCAGGTA CTCCCGTCAC GTTCACGCAG GGTGGGACAA CTGCGGACTA CATGAACAAC
GACTTTCACC AGCCAGCTAC GACAAGATCT GGTCAAGTCA CGCTCGACAG TAATGAAACT
TATACGATCC AGGTTCGGTA TTGGGAAAAT CAGGGTGCAG ACGTTCTTCA GGTTACAGTC
GATCCGCCTG GGTCGACGGG GGCGGTCAAT ATAAATGACA GTGGCCTGGT GGGGGTGCCC
CCGGACCCTG ATTATTCGAC AACGGGCGTA CCAGCTGGCG TTGAGGGAGA CGACTTTCTC
AGCGGAGGTG CGGGAAATGA CACCCTTTGG GGCGATGGAG GCGACGACAC GCTCAACGGT
GGTGCGGGTG ACGACATCAT TGATGGCGGC AGGGGGGCAG ACACGATCGA TGGTGGATTT
GGGCTCGATA CCATCGACGG GGGCGAGGAA GACGACATCA TTGCAGGTGG TGAAGGCAAT
GATGTGATCA CTGGTGGTGT TGGTGATGAC ATCATATTTG GAGACGAGTT CGACCCCGGT
GCGGGGAATG AATATTTTTA TGTGGGCAAT GCCTACATTG CCGACGAAAA CTATACCATC
ACCGGGTCAG GCGATTTTGC GGGCCCCACA GCGAGTGAAA TTCAAGTCAC AACTGATCCC
GTCACAATAC GGTTTCTGGA TGATGACACA TCCTTGGGTG GTGATGACGG GGCAAATGAA
GTCAGGGCAG ACCCGACATT TCAGAGGGTT GAGATTGATG GGGTCGAATA TGGGGCGAAT
CTGGATTTCA GCGTTACGTT TGAAGATCCC ACGGGCACTG TCTACCGGTT TGCAGTGCTC
GATGTCGACT TTGATGGGGA CGGCTCCAAC CAAGATGCCA ATGAAGACGG CCATATCCTC
ATACAGATCG ATGGCCCGGC CATCGTGCCC GGAACTGTTC TTGATACTCG TAGTGGGTTT
CAGAACATTT CCGCAACCGA CTACGACGCT CTGAGAGCAG CGACTGCATT TAACGACACC
ATTGATGGCG GAGATGGTAA TGACACAATC GAAGGTGGGC GTGGCAATGA TGCCATCACA
GTAGGTCGTG GTGATATAGC TACCGGTGGT GCGGACGCTG ATACTTTTAC GTTGGATTTT
GGTCAAACCA GCACTGACGG ATCAATGGTC GTTACCATTG ACGGCTCAAC TACCGAGATC
GACGGAGTTG ACAATGATAG CTTGGACCTG ACTGGTTTTG GTACGTTCAC TCTCACACAG
ACCACCGACA CAGACGGTAA TTCAACAAGC GGTACAGCAG TTTATGCAGA TGGAACGACT
GTCAACTTCA CCGAAATTGA AAATCTGATT GTGTGTTTCG CAAAAGGAAC ACAGATCAAG
ACAAGTGATG GTATTCGCGA CATTGAGAGC CTGCAAGTTG GCGACAAGCT CGTAACCAAA
GACAATGGCC TGCAGTCTAT TCGGTGGATC GGCAAGCGGA CGCTGAGTGA AGACAAACTA
GACGCCCACC CAAAGCTCAA ACCCATCCGC ATCAAAGCGG GCGCATTGGG CGAAGGGATG
CCTTCACGTG ATCTGATTGT TTCTCCACAA CACAGGATCG TTGTTCGGTC CAAGATCGCA
ATCCGAATGT TTGACACGCA AGAAATACTT GTGCCTGCAA AGCACCTGAT TGGCCTACAG
GATGTGTCGA TTGCTGCCGA AATGCGCGAA GTGACTTATT ATCATTTGTT GTGCGACAAT
CATGAGATCA TCGAGGCAGA TGGCGCATTC GCTGAGACAT TATACACGGG TACAGAGGCG
ATGAAAGCGA TGTCGCCAGA GGCACTAGAA GAGATTATGC TGATCTTGGG AGATCAGTTC
TCGACGCGCC GACCATTGGC CAGGTTCACG CCAAAGGGTC GGCAGGCCAA GAAATTGATA
GAACGTCATA TCAAGAATGA TCGAGCGGTG TATTGTTAA
 
Protein sequence
MADNDSGTPA TESNQTLTGT GADDQLSGGL GDDVITGGAG DDVLRGDGPI AGSWHYETFD 
RNFTSANGQA FDFGGDASER TGSGYVTDFD ESSLTNAIRG TTGNPGDFGV IYTSTLNITD
SGVYTFATNS DDGSTIQIFD SAGTPVTFTQ GGTTADYMNN DFHQPATTRS GQVTLDSNET
YTIQVRYWEN QGADVLQVTV DPPGSTGAVN INDSGLVGVP PDPDYSTTGV PAGVEGDDFL
SGGAGNDTLW GDGGDDTLNG GAGDDIIDGG RGADTIDGGF GLDTIDGGEE DDIIAGGEGN
DVITGGVGDD IIFGDEFDPG AGNEYFYVGN AYIADENYTI TGSGDFAGPT ASEIQVTTDP
VTIRFLDDDT SLGGDDGANE VRADPTFQRV EIDGVEYGAN LDFSVTFEDP TGTVYRFAVL
DVDFDGDGSN QDANEDGHIL IQIDGPAIVP GTVLDTRSGF QNISATDYDA LRAATAFNDT
IDGGDGNDTI EGGRGNDAIT VGRGDIATGG ADADTFTLDF GQTSTDGSMV VTIDGSTTEI
DGVDNDSLDL TGFGTFTLTQ TTDTDGNSTS GTAVYADGTT VNFTEIENLI VCFAKGTQIK
TSDGIRDIES LQVGDKLVTK DNGLQSIRWI GKRTLSEDKL DAHPKLKPIR IKAGALGEGM
PSRDLIVSPQ HRIVVRSKIA IRMFDTQEIL VPAKHLIGLQ DVSIAAEMRE VTYYHLLCDN
HEIIEADGAF AETLYTGTEA MKAMSPEALE EIMLILGDQF STRRPLARFT PKGRQAKKLI
ERHIKNDRAV YC