Gene TM1040_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0041 
Symbol 
ID4076308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp43680 
End bp44879 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID638005328 
ProductHI0933-like protein 
Protein accessionYP_612036 
Protein GI99079882 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.32138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGACT GGGATGCGGT CGTGATTGGA GGCGGTCCGG CTGGATTGAT GGCAGCAGGG 
GAGGTCGCGC GTCAGGGCCA TCGGGTGCTC TTGGTGGAAG CCAAGCCGTC GCCTGCGCGC
AAGTTTCTGA TGGCCGGAAA ATCCGGTCTA AACCTGACCA AAGACGAACC CTTCGAGGAT
TTGCTGGCGC AGTATGGCGA CTCGGCTGAG TGGCTGGCAC CGATGATCAA GGCGTTTGAC
GCTGCGGCTG TGCAAGACTG GGCGCGCGGG CTCGGGCAGG AGCTTTTTAC CGGGTCGACG
GGACGGGTGT TTCCCACGGT CATGAAGGGC TCTCCCTTGT TGCGGGCGTG GCTTCAGGAT
CTGGACACAC ATCGCGTTAC CCGGCAACTC GGCTGGCGCT GGACAGGTTG GCAGCAGGAC
GGGCAGCTGT TGTTCGGTAC GGCTGCAGGG CCACAAATCG TGACATCCCG CGCCACCATA
CTGGCGCTTG GTGGTGCGAG CTGGGCACGG CTTGGTTCGG ATGGCGCCTG GGCGGCCCTA
TTGGCGTCGC GCGGCGTCGC CTTGGCACCG TTTCAACCGT CGAACGCTGC CCTTTCGGTC
GCCTGGAGTG ATCACATGAC GCCGCATTTT GGCGCAGCAC TCAAGGCAGT GGCCTGGCAG
GCAGGCGCAT TGCAGGCCCG TGGCGAGGCG ACCTTGTCGC AACGCGGGCT CGAAGGCGGT
GGGCTTTACA CGTTGACCCC AGCTCTGCGC GAAGGGCAGC CGCTTTTTGT CGACTTGTCG
CCGGACCTCA ACGAGGGCGA TCTTGCCCGG CGGCTCGCGA AACCGCGTGG TAAGACGAGC
TGGTCGAACC ACATGCGCCG CACGCTCAAG CTTGCGACGG TGAAAATGGC ACTATTGCAA
GAGTTTGGCC GCCCGCTGCC GCAGGATCCA GAGAGCCTGG CCCGTCTCAT CAAACATTTG
CCCGTGCGCC ATACGGGGTT ACGCCCAATG GACGAGGCAA TCTCGACGGC TGGCGGTGTG
CGCCGCGATG CCTTGGATGA CGGCCTCATG CTAAAGGCGA TCCCCGGCAC GTTTTGCGCC
GGAGAGATGC TTGATTGGGA TGCGCCAACG GGAGGGTATC TCTTGACTGC CTGCTTTGCG
ACCGGCCGTT GGGCAGGGCA AGCGGCGGCG CGCTACTTGG CGAGTTCCGC CACGCGCTGA
 
Protein sequence
MQDWDAVVIG GGPAGLMAAG EVARQGHRVL LVEAKPSPAR KFLMAGKSGL NLTKDEPFED 
LLAQYGDSAE WLAPMIKAFD AAAVQDWARG LGQELFTGST GRVFPTVMKG SPLLRAWLQD
LDTHRVTRQL GWRWTGWQQD GQLLFGTAAG PQIVTSRATI LALGGASWAR LGSDGAWAAL
LASRGVALAP FQPSNAALSV AWSDHMTPHF GAALKAVAWQ AGALQARGEA TLSQRGLEGG
GLYTLTPALR EGQPLFVDLS PDLNEGDLAR RLAKPRGKTS WSNHMRRTLK LATVKMALLQ
EFGRPLPQDP ESLARLIKHL PVRHTGLRPM DEAISTAGGV RRDALDDGLM LKAIPGTFCA
GEMLDWDAPT GGYLLTACFA TGRWAGQAAA RYLASSATR