Gene Anae109_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2471 
Symbol 
ID5377763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2863226 
End bp2864317 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content75% 
IMG OID640843990 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001379656 
Protein GI153005331 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.325849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGC CCGCCCTGCG CTGGACCCTG TTCCACCTCG ACCCCGAGCG CGCCCACCGC 
CTCGCGCACG GCGCGCTGCA CCGCGTGCCG CCGGGGCTGG CGCGGCTGCG CCGTCCCGCG
GTGCCGCCGG AGCTCCGCGT CTCCTGCCTC GGGCTCGACT TCGACGGCCC CATCGGCCTC
GCCGCCGGCT TCGACAAGGG CGACGCCTCG ATCGCGGGGC TCTTCGCCCT CGGCTTCTCG
CACGTGGAGA TCGGGACCAT CACCCCGCGG CCGCAGGCCG GCAACGAGCC GCCGCGGCTG
TTCCGCCTCG TCGAGCACCG CGCCCTCGTC AACCGGATGG GCTTCAACAA CGCCGGGGCC
GAGGTGTGCG CGCGCCGCCT CGCCGGCGTC CCCGCCACGG CGCGGATGGG CCCGGTGGGC
GTCAACGTCG GGAAGAACAA GACGACGCCC AACGAGGACG CGGCGGCGGA CTACCTCGCC
TGCATCGACC GGCTCCACCC GTACGCCGAT TACCTCGTCG TGAACATCTC GTCGCCGAAC
ACCCCGGGGC TGCGCCAGCT CCAGGAGCGC GACCAGCTCG ACGCGCTGCT GCGCGCCTGC
GCGGGGAGGC TCCGCGAGCG GGCGCCGGGC AAGCCGCTCC TCGTGAAGCT CGCCCCCGAC
CTCTCCCCGA CCGCGCTCGA CGAGGCGGTG GACGTGGCGA TCGACGCCGG GGTGTCCGGC
ATCGTCGCGA CGAACACGAC CCTTTCGCGG GCGGGGGTCG AGCGTCACCC ACGCGCCCGT
GAGGCCGGCG GGCTCTCGGG AGCGCCGCTC GAGGCACTCG CCACGAGCGT GGTGCGGCGC
TGCTACATCC GCGCGGCGGG TCGGGTGCCC ATCGTCGGGT GCGGCGGCGT GATGAACGCG
GAGGGCGCCT ACGCCAAGAT CCGCGCTGGC GCGACGCTCG TGCAGGTCTA CACCGGCCTC
GTCTACGGCG GGCCGGGGTT CGTGCGGCGC CTGAACGACG GCCTCGCGAG GCTGCTCGCC
CGCGACGGCT TCCGCACCGT CGCCGAGGCG GTGGGCGCCG ACGTCGAGAC GGCCGAGCGG
GCAGGCGTCT GA
 
Protein sequence
MIWPALRWTL FHLDPERAHR LAHGALHRVP PGLARLRRPA VPPELRVSCL GLDFDGPIGL 
AAGFDKGDAS IAGLFALGFS HVEIGTITPR PQAGNEPPRL FRLVEHRALV NRMGFNNAGA
EVCARRLAGV PATARMGPVG VNVGKNKTTP NEDAAADYLA CIDRLHPYAD YLVVNISSPN
TPGLRQLQER DQLDALLRAC AGRLRERAPG KPLLVKLAPD LSPTALDEAV DVAIDAGVSG
IVATNTTLSR AGVERHPRAR EAGGLSGAPL EALATSVVRR CYIRAAGRVP IVGCGGVMNA
EGAYAKIRAG ATLVQVYTGL VYGGPGFVRR LNDGLARLLA RDGFRTVAEA VGADVETAER
AGV