Gene TM1040_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3654 
Symbol 
ID4075623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp708264 
End bp709220 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID638005174 
ProductUBA/THIF-type NAD/FAD binding fold 
Protein accessionYP_611883 
Protein GI99078625 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCT ATGCTCGCCA GATGATGCTG CCCGAGGTGG GGGCTGTAGG GCAGGCGCGT 
CTCAGCACTG CGCGCGTTCT GGTTGTAGGG GCAGGGGGTC TCGCTGCGCC GGTTCTGCCG
CTCTTAGCGG GGGCCGGGAT CGGACATATC ACCCTGATCG ACGGCGATGT TGTGAGCCTG
TCCAATCTGC ATCGCCAGAC CTTGTTTCAA GAAACCGACT GTGGCCGTCC CAAAGCCGAA
GTCGCCGCGC AGCGCTGCAG CGCCCTCAAC AGTGAAATTG AGATCGTCGC GGTTGCACAT
GCGCTCACTC CAGCCAATGC GCCGCTCGTT CTTGCAGATG TGGATCTCGT GCTCGATTGT
GCGGACAGCT ATGCCGTGAG CTACCTCCTG AGCGATCTCT GCCATGCCCA GAAGACTCCG
CTTATCAGCG CTTCGGTGCT GGGGTCAGGC GGATATGTTG GCGGTTTTTG CGGTGGGGCG
CCTTCCTTGC GGGCGGTGTT CCCCGATGCC CCCGACAACA GTGCCAGCTG TGAGACGGCA
GGTGTCTATG GCCCTGTGGT TGGAATGATT GGCGCGTTGC AGGCTCAGAT GGCGCTCAAT
ATTCTCTTGG AACATGTGCC CTCGCCTCTG GGCCAAATGG TACAGCTGGA TTGTCGCAGC
TATCGCTCGA CGACCTTTCG TTTCGACCAT GCGCCCGAAC CCGAGGTGAG CTTTCCGTTT
GTAGCCATCG AAGAGCTACA GGCAGATGAT CACATCATCG AGTTGCGCGC AGATGCGCCA
CTGCTTCACC CAAAGGCCAG GCGATCGGAC GCAGAGATGC TGTTGCAGAC CCTGCCCAAT
CCCCAAAAAC GTTTGGTGCT GTGCTGCGCC ACGGGTCTGC GGGCCTGGCG CACGGCAGAA
AGAATTCACC CCATCTGGCC GGGCGAGATC GTTCTCGTCG CGGCCTCCGC ATCCTAA
 
Protein sequence
MSRYARQMML PEVGAVGQAR LSTARVLVVG AGGLAAPVLP LLAGAGIGHI TLIDGDVVSL 
SNLHRQTLFQ ETDCGRPKAE VAAQRCSALN SEIEIVAVAH ALTPANAPLV LADVDLVLDC
ADSYAVSYLL SDLCHAQKTP LISASVLGSG GYVGGFCGGA PSLRAVFPDA PDNSASCETA
GVYGPVVGMI GALQAQMALN ILLEHVPSPL GQMVQLDCRS YRSTTFRFDH APEPEVSFPF
VAIEELQADD HIIELRADAP LLHPKARRSD AEMLLQTLPN PQKRLVLCCA TGLRAWRTAE
RIHPIWPGEI VLVAASAS