Gene TM1040_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2971 
Symbol 
ID4078001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3136295 
End bp3138247 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content54% 
IMG OID638008300 
Producthypothetical protein 
Protein accessionYP_614965 
Protein GI99082811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.123475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTTA CACCGCTTAT TCCATCATCT GGGGTTGCGG GCTGGAATTT TCTGCAATCG 
ACCTATGACC GGCAATACGA TGCGTTTGTC CAGTCCGGGA AGTTGAAGAA TGACAGTGAG
TACTTCGCCG AGAATATCGG CGAAGTCACG TCCAGCGAAG ACTTGCTTAA TGACCGCCGC
CTGCTTCAGG TGGCGGTTAA GGCGTTTGGT CTGGAAGAGG AAATCAACTA CCGGGCCTTG
CTGCAGCGTG CGCTCAACGA AGGCACCTCT GCCAGCGATG CGCTTGCCAA CACAATGAAT
GACGAGCGCT ACGTCGAATT CTCCAACGCC TTTGGATTCG GCCCGGGTCA ATCACCGATG
ACCAGCGACA GCAAAGCAAT GCAAGCGGTG ATCGACAAGT TCCAGTCTGC CTCCTTCGAG
GAAGCGGTGG GAGAAGTCGA TGAAACGATG CGCACCGCGC TCTTTGCCAA ACGGGCCATG
ATCGAGGTCT TTGGCGAGCC GGACGAGGAT GACGTATCGC AGCTGAGCGT GAGAGAACGC
GCCCTTCGCG AGTTTGACCT CGCCATGAAG GAGATCAACG GCAAAGACGA TGATGTCCCT
GGTGTGACCT CTGTCGAGGA CCAGTGGGAA GATATTATCG AGCGCGACAC GCTTCGAGAA
TTCTTTGATA CCACGCTCAG AATCTCTGCA GGTGCCGCTG GACTTGAAGA CGACGAACGC
ATCCAGCTTT ATCGAGAACG TGCGCAGATT ATCTTTGGAA CCGACGATCC AACGGTCTTC
TTCTCGGCCG AGAACAAAGA CACAATCATT TCCGCCTTTA AAACCCGTGC AACCGTGAAC
GGGGATGATG CAGCTGAAAC CGCAAAGACT GCGGAACTGT CAGAACACAT CCTGGATCAG
ATGATCTCTC GCGATGCCGC GCTGAATGAA GAATGGGATT TCATCTCCCG GCAGGAACCC
CTGGCGGAGT TTATGAAAAC CGCTCTGGAA TTGCCTGACG ATATCGCGAC ACGAGAGACC
AGCGAAGCCA TGCGGATCTA TCGTGAAAAA GCCATTGAAG CCTTTGGCAC AGATGACCCC
AATGTCTTTG CTAGTGCAGC AAACTTGGAT GCAACACTCG AAGTTTATCG GGAAAATGCC
ACGAACGCTG GTCTCTCCAG TAGCGAAATC TCTTCCAATC TGCGCACGGC TGAGACCATC
CTAAAATTCT CCTACAATCA GGGGGATGCG ATCGATGGTG GTGATGCGGA TGCCGCGGCA
ACCGCGGAAA TAGATGCCGG GTGGTACAGT GTCATGGGTC AAACCGGCGT CCCGCGCTTC
TTGAACACGG CCCTTGATGT GTCCTCTGCG CTTGCGCCGG GTGAAATCTT TTCGGACCTG
ACGATCGACG AGCAGCTAAC AATCTACAAA GATAAAGCGG TTGAACTGTT TGGGACCGCT
GACCTCAAAG AGCTGACCGG CCCGTATCAA ATCGGCGCAG TGAACGATGC CTATCGCACC
AACGCGGAAG CTGCGGGTGA GAGCGAATTC TTCATTACCT ATTATTCCGA TATCGCCGAG
CGAGAACTAA ACACCTTGTT CCAGAGCGAC GATACCGATG CTGAAAAAGC GTTTGAAGAA
GCTCTCGAAG AACTGCGCAC GATGAATGAT GAAGACAATG GTCCAAGCGC CAATTCGCAG
TGGTTCACGA TCATGGGCCA GCAGGCGATG ACCGACTTCA TGCAAGTCGC GTTGGGCCTG
CCCAAAGAGG TTGGTCAAAT GGATATTGAC CAGGCTGTCG AGGTCTACAA GCGCAAGGCA
CAACAAGTTC TGGGGACCGA CAAACCCTCG GAATTCATCT CTGGCGACAA GATGGATGAG
CTGGTCAACA TGTATCTCAC CCGCTCGCAG ATGAACAATC TCAGCAGCGG ATATAGCTCC
GGCAGCGCCG CCCTCATGAT GCTGCGCGGC TAA
 
Protein sequence
MTFTPLIPSS GVAGWNFLQS TYDRQYDAFV QSGKLKNDSE YFAENIGEVT SSEDLLNDRR 
LLQVAVKAFG LEEEINYRAL LQRALNEGTS ASDALANTMN DERYVEFSNA FGFGPGQSPM
TSDSKAMQAV IDKFQSASFE EAVGEVDETM RTALFAKRAM IEVFGEPDED DVSQLSVRER
ALREFDLAMK EINGKDDDVP GVTSVEDQWE DIIERDTLRE FFDTTLRISA GAAGLEDDER
IQLYRERAQI IFGTDDPTVF FSAENKDTII SAFKTRATVN GDDAAETAKT AELSEHILDQ
MISRDAALNE EWDFISRQEP LAEFMKTALE LPDDIATRET SEAMRIYREK AIEAFGTDDP
NVFASAANLD ATLEVYRENA TNAGLSSSEI SSNLRTAETI LKFSYNQGDA IDGGDADAAA
TAEIDAGWYS VMGQTGVPRF LNTALDVSSA LAPGEIFSDL TIDEQLTIYK DKAVELFGTA
DLKELTGPYQ IGAVNDAYRT NAEAAGESEF FITYYSDIAE RELNTLFQSD DTDAEKAFEE
ALEELRTMND EDNGPSANSQ WFTIMGQQAM TDFMQVALGL PKEVGQMDID QAVEVYKRKA
QQVLGTDKPS EFISGDKMDE LVNMYLTRSQ MNNLSSGYSS GSAALMMLRG