Gene TM1040_2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2841 
Symbol 
ID4076660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3008991 
End bp3010541 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content60% 
IMG OID638008170 
Producthypothetical protein 
Protein accessionYP_614835 
Protein GI99082681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.876193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.631285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAT TGATGTCCGT CGAGTGGTTC ATTCTGGTGA CCCTCTGCAT CGCCAGCGCA 
GCGGTTGCGG TCTGGTTTCT GAACCCCAAG GCCGAGGCCA AGGGGATGGA GCACGAATTG
CTGATCGCCG ATGGCCGCAC GGATGCGGTT TTCCTATTTG ACGACCAGAC GCTGATTGCC
TGGTCATCGG GCGCGCGCAA GTTCCTCGAT GACAATGCCG AAGATTTCAG TTGGGGCAAG
CTGCGGGACA AGCTGTCGCG CTCCTACCCG GGCCTACCGC AGTCACCCGG TTTCCTGCGC
GATGTCGGCT CCCTGACACT ATCTGGCACT GCAGAGGCAG ACAACCGCGA GGCCCATTGC
GAATGGATCG ACGGCGTTAC CCGTATTCAG CTGCGCCGCG CCGCCTCGGA CATCGGCCAC
GTCGGCGATG ATCAGGAACT GACCACCCTG CGTGCAGCCG TGCATCAGGC CCCCTACCCC
GTTTGGCTGC AATCCGATGA TGCCCGCGTC ACCTGGACCA ACCTTGCCTA CGATCGTCTG
AACCAGAAAA TCCGTGGCCG CGGCAATGAT ACCTCTGCAC CTCTGTTTCC AAACCTCGAC
GCGCCGATGA ACAGCGGTCG CGCGGAACGG ATCTCCATCG AGCTGCCGGA AACCGACAAG
AAGCTCTGGT ATAATGTCTC CACCACCGAG ACCGAAGCCG GTTGGCTCTG TCATGCCATG
GACGTCAACG CCGTGGTGGA TGCTGAGATC GCGCAACGCA ACTTTGTGCA GACGCTGGCA
AAAACCTTTG CGCAGCTTTC CATCGGCCTT GCGATCTTTG ACCGCAACCG CCAACTGGTC
CTGTTCAACC CCGTCTTGAT CGACCTCACC GCCCTGCCGG CGAGTTTTCT GAGCTCGCGC
CCCAACATGA TGTCCTTCTT TGACCGCCTG CGCGACCAAC GCATGATGCC GGAACCAAAG
AACTACTCGA GCTGGCGTCA CCAGATGGCG GATCTCCTGG AGGCTGCCGC CGAGGGCCGC
TATCAGGAAA CATGGTCTTT GCCCTCCGGG TCGGTCTATT CGGTCTCTGG CCGTCCACAT
CCTGATGGGG CCATCGCCTT TCTGTTTGAA GACATCACCG CCGAAATCAC TCTGACCCGT
CAGTTCCGCT CTGAACTGGA GTTAGGCCAG TCGATCCTCG ATCATATCGA CGACGCCTTT
GCGGTCTTCG GCGCCGACGG TAGCATGGTC TATTCCAACA CCGCCTACCA CACGATGTGG
AAAACCGACC CCGACGCAAG CTTTGCCAAA ATCACCATCA CCGATGCCTG CCGCACCTGG
CAGGAGCTCT CGGGACCAAC CCCAACCTGG GGTGAAGTGC GCGACTTTGT AGCCGGTGGC
ACCAGTGATC GTTCTGATTG GTGGTCTCGC GTGGACATGC GCAACGGTGA GCAGTTGCTC
TGCCGCGTGC AGTCGCTCCA GAACGGCTCC GTTATGGTCC GTTTCCAGCA GATCGACCTG
CCGCTGGCGC CGCCAGAACA AGAGCGTTTC GCCGTCCTCA AGGATAGCTG A
 
Protein sequence
MQELMSVEWF ILVTLCIASA AVAVWFLNPK AEAKGMEHEL LIADGRTDAV FLFDDQTLIA 
WSSGARKFLD DNAEDFSWGK LRDKLSRSYP GLPQSPGFLR DVGSLTLSGT AEADNREAHC
EWIDGVTRIQ LRRAASDIGH VGDDQELTTL RAAVHQAPYP VWLQSDDARV TWTNLAYDRL
NQKIRGRGND TSAPLFPNLD APMNSGRAER ISIELPETDK KLWYNVSTTE TEAGWLCHAM
DVNAVVDAEI AQRNFVQTLA KTFAQLSIGL AIFDRNRQLV LFNPVLIDLT ALPASFLSSR
PNMMSFFDRL RDQRMMPEPK NYSSWRHQMA DLLEAAAEGR YQETWSLPSG SVYSVSGRPH
PDGAIAFLFE DITAEITLTR QFRSELELGQ SILDHIDDAF AVFGADGSMV YSNTAYHTMW
KTDPDASFAK ITITDACRTW QELSGPTPTW GEVRDFVAGG TSDRSDWWSR VDMRNGEQLL
CRVQSLQNGS VMVRFQQIDL PLAPPEQERF AVLKDS