Gene TM1040_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3334 
Symbol 
ID4075233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp344066 
End bp345592 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content58% 
IMG OID638004842 
Producthypothetical protein 
Protein accessionYP_611568 
Protein GI99078310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.881779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.501512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG ATCTAATCCT TCACGGCGTT CCAGATCGCA TCAGGTTGGA CGGGTGGTGG 
CTCGAGACGA CAGAAACACC AGAGCCGGAC ATCGCGCCGC CTATGAGGGC AAATGGCTCG
TTGATGGCGT CATTGCGCAT GCGGCAGGTG ATCCCCCTGC TTGGCGCGCT TGTGGCACTT
GCGGATTGGC TGTTCTGGCA TCAGCCGGTC GGTCTCTCAC TCGCCATTTT TGCCGTCGTT
GTCTCCGCAG CGATTCTGGC GGTGAAACCT GAGCGACCAA GCTTACGGAG CTGGGGCCTC
GCCATGGGGT TTGCGTTGCT TTGCAATCTG CCAGTGGTGA TCGAGCTTCA ATTTCTGTCA
CTCCTGTTTA GCCTCGGAGG TCTCATCACG CTTGCGGCTT GGGCGTTTGC GGGGTCCAGC
TTGACGACGG GGCTCATACT GAGGATGGCA CTTCGCCTCC CTGCCTTTGG GCTAGTACAT
TTGGTGAAAG ATACCGCCGA TGCGCTACCG CCTGCAACAT ACTCGTCACG GCTCCGGTAC
ATGGCGGCCA CGCTGCTATT GCCGCTTCTG ATGGGGGCCG TGTTTCTGGG CCTTCTCGCA
AATGCCAACC CAGTCCTACA GGCGGCGCTG GACAGCATCG ATCTGCGCCA CCTGCTCAGG
GCTGAATTTT GGACGCGCTT TCTGTTCTGG GGGTGCGTGG CGTCACTCCT CTGGCCGCTC
CTCAATCTGA GCGAGTCATG GATTGGGGCA CAAGCCCGCC GAGCGCGCGT AACAAAGGCG
GGTCCGCACC GTAGCAGCTT CTTGATCAAT CCGCTTTCCG TGCGCAATTC GCTTTGGCTG
TTCAATTTGA TGTTTGGCAT CCAGACCCTG ATGGACCTCA GCATATTAAC CGGCGGAGTG
TCGCTGCCCG AGGGCATGAG CTATGCCTCA TATGCACATC GCGGCGCCTA TCCCCTTGTG
GCAACGGCGC TGCTCGCCGG ACTCTTTACA CTGCTCACGC GAAATATGAT TGGCCAAGAC
AAGGTTCTGC GGTCTCTGGT CTATCTGTGG CTGGCGCAGA ACATGATACT TGTTGCAACA
GCCGCGATCC GATTGCAGCA CTATGTTGAG GCCTACGCCC TTACCTACCT GCGTGTCGCG
GCATTTATCT GGATGGCTCT GGTTCTGACA GGGCTGCTGT TGACGATCTG GCAAATCCAT
CGCGGGTTTG GGACATCATG GCTATTGCGA CGGTGCTTGG CTGCGCTCGC CATCACGCTC
TACCTCTCCA GCCTCACAAA TTTTGCCGAC ATAGTCGCCA GATATAATCT CACCCATGGC
AGCGCGCTTC GGGGGCCTGA CACCTACTAT ATCTGCAGTC TCGGTCCGGG GGCCTACCGC
ACGATACTGG ATCATGAAGC AAGCACCGGA CAGGATATTT GCACACGCAT GATTGAACGC
GACCTCGAGC GCATTTCAAT CCGGAACTGG CGCGAATGGG GCTATCGGAT GTGGCGGCTT
GAGGCCTATG ATCGGGCGCA GAATTGA
 
Protein sequence
MAQDLILHGV PDRIRLDGWW LETTETPEPD IAPPMRANGS LMASLRMRQV IPLLGALVAL 
ADWLFWHQPV GLSLAIFAVV VSAAILAVKP ERPSLRSWGL AMGFALLCNL PVVIELQFLS
LLFSLGGLIT LAAWAFAGSS LTTGLILRMA LRLPAFGLVH LVKDTADALP PATYSSRLRY
MAATLLLPLL MGAVFLGLLA NANPVLQAAL DSIDLRHLLR AEFWTRFLFW GCVASLLWPL
LNLSESWIGA QARRARVTKA GPHRSSFLIN PLSVRNSLWL FNLMFGIQTL MDLSILTGGV
SLPEGMSYAS YAHRGAYPLV ATALLAGLFT LLTRNMIGQD KVLRSLVYLW LAQNMILVAT
AAIRLQHYVE AYALTYLRVA AFIWMALVLT GLLLTIWQIH RGFGTSWLLR RCLAALAITL
YLSSLTNFAD IVARYNLTHG SALRGPDTYY ICSLGPGAYR TILDHEASTG QDICTRMIER
DLERISIRNW REWGYRMWRL EAYDRAQN