Gene Bind_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1068 
Symbol 
ID6199300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1225574 
End bp1226659 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content60% 
IMG OID641705061 
Productdihydroorotate oxidase 
Protein accessionYP_001832200 
Protein GI182678054 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.108109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCT TGGCGCCGCA TCGGCTGCTC TTCGGCCTCG ATCCCGAGCG GGCGCATCGC 
CTGGCGATCT GGGCGCTCGC GCATGTTCCC CTCCCGCCGC CTCCGGCCGT TGATCCCAGG
CTGTCGGTCG AGGCTTTTGG CCTTGCCTTT GCCAATCCGC TCGGCCTTGC CGCGGGCATG
GACAAGAATG GCGAGGCGCC GGGCGCGCTT TTGCGGCTCG GTTTTGGTTT TGCCGAAATC
GGCACGGTGA CACCGCTGCC GCAGCCGGGC AATCCGTTGC CGCGCCTGTT TCGCCTGACC
AAGGATGAGG CGATCATCAA TCGTTTTGGC TTCAACAGTG AAGGCCATGC GCGCGTACAT
GAGCGGCTTA CGGCTTTCTT GGCAGGCGCG CGGCGGAAAG GCGTGATCGG CATCAATATT
GGCGCGAACA AAGACAGTCT CGATCGCGCC GAGGATTATG TGAAAGGCAT TCATGCCTTT
GCCGATTGTG CGGATTATTT CACAATCAAT ATTTCCTCGC CCAATACGCC GGCTTTGCGC
GATTTGCAGC AGCCCCTTGC GCTTGATGAT CTTGTGGTGC GCGCGCTGGC TGCGCGTGAT
GCCGAGGCGG AGCGCCATGG CCGCAAGCCG GTGCTGGTGA AGATCGCACC CGATCTGACA
TTGTCCGAAC TTGACGCGAT CGTGAAAATC ACCTGCGCAC GAAAAGTCGA TGGGCTGATC
GTCTCCAATA CGACGCTCAG CCGGCCTTCA GATCTTCAGG ATCCGCAAGC GCGTGAGGCG
GGAGGTCTTT CGGGCAAGCC GCTCTTTGAT CTCTCGACAC GCATGCTCGC TGAAACCTTT
GTGCGCGTCG AGAACCAGTT TCCACTGATC GGCGTGGGCG GGATCGACAG CGCGGAAACA
GCACTCGCCA AGATCGAGGC AGGGGCTACT CTCGTGCAGC TTTATTCGGG CCTCGTTTTT
CAAGGCCCGG GGCTGGTCAG GAGGATTCTT GGCGAATTGC CGCGCTTGTT GACAGCGCGA
AATTATCCAC GGCTCGTGGA TGCCGTGGGG GCTGCCGCGG TCGATCGGGT GAGTGAAAAA
ATCTGA
 
Protein sequence
MTFLAPHRLL FGLDPERAHR LAIWALAHVP LPPPPAVDPR LSVEAFGLAF ANPLGLAAGM 
DKNGEAPGAL LRLGFGFAEI GTVTPLPQPG NPLPRLFRLT KDEAIINRFG FNSEGHARVH
ERLTAFLAGA RRKGVIGINI GANKDSLDRA EDYVKGIHAF ADCADYFTIN ISSPNTPALR
DLQQPLALDD LVVRALAARD AEAERHGRKP VLVKIAPDLT LSELDAIVKI TCARKVDGLI
VSNTTLSRPS DLQDPQAREA GGLSGKPLFD LSTRMLAETF VRVENQFPLI GVGGIDSAET
ALAKIEAGAT LVQLYSGLVF QGPGLVRRIL GELPRLLTAR NYPRLVDAVG AAAVDRVSEK
I