Gene Nmul_A0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0354 
Symbol 
ID3784546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp383760 
End bp384803 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content58% 
IMG OID637810430 
Productdihydroorotase 
Protein accessionYP_411054 
Protein GI82701488 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATACCT TCACACTTAC ACGCCCGGAC GACTTCCACC TGCATCTCCG CGATGGCGAA 
CACATGCGGG CAGTGCTGGC GGACACTGCC CGCCGTTTTG CCCGCGCAAT CGTCATGCCC
AACCTCAAGC CTCCCGTCAT CACCACGGAA ATGGCGGTGG CGTATCGCGG GCGGCTTCTT
GCCGCCCTGC CGCAGGGCGT GCGCTTCGAG CCGCTGATGA CCCTCTACCT GACCGATAAC
ACCTCCCCTT CGGAAATCAT CGAAGCAAAA AGAAGCGGAG TAATCTACGG CATCAAATAC
TATCCCGCAG GCGCTACCAC AAACTCCGCG GCAGGAGTCA CCGACATTGC CAGATGCCAT
GAAACTCTGG AAGCCATGGA GCAAGCGGAA ATGCCCATGC TGGTGCATGG CGAGGTAACC
GATCCGGAGG TGGATGTATT CGACAGGGAA AAGGTCTTTC TTGAACGGAT GCTGATCCCG
TTGACCCAGC GTTTTCCGCG ATTGCGCGTG GTATTCGAGC ATATCACCAC ACGGGAAGCG
GTAGAGTTCG TCATCAACGC GCCGAAAACC GTCGCCGCCA CCATCACGGC CCACCACCTC
CTGATCAGCC GGAATGCCCT TTTTCAAGGG GGTATTCGGC CTCACCACTA TTGCCTGCCC
ATCCTCAAGC GGGAAACCCA CCGGCAGAAG CTGATCGAAG CGGCCACCAG CGGCAATCCG
AAATTTTTCC TCGGCACCGA TAGCGCTCCC CACGCACAAT TCGCCAAGGA AAACGCCTGC
GGCTGCGCCG GCATCTATAC TGCTCACGCA GCGATCGAGC TGTATGCGGA AGCATTCGAA
CAGGCGGGCG CGCTGGAAAA ACTGGAAGCC TTTGCCAGTT TTCATGGTGC GGATTTCTAT
CAGCTACCGC GCAATCAGGA CAAGATTACG TTGAAGAAGG AAAATTGGAG GGTGCCGGCG
CAGCTGGAAT TTGGCGGCGA AAGCCTGATT CCATTCCGGG CGGGGGAAAA CGTTACCTGG
GCCTTGGATA AAACCGGGAA TTAG
 
Protein sequence
MNTFTLTRPD DFHLHLRDGE HMRAVLADTA RRFARAIVMP NLKPPVITTE MAVAYRGRLL 
AALPQGVRFE PLMTLYLTDN TSPSEIIEAK RSGVIYGIKY YPAGATTNSA AGVTDIARCH
ETLEAMEQAE MPMLVHGEVT DPEVDVFDRE KVFLERMLIP LTQRFPRLRV VFEHITTREA
VEFVINAPKT VAATITAHHL LISRNALFQG GIRPHHYCLP ILKRETHRQK LIEAATSGNP
KFFLGTDSAP HAQFAKENAC GCAGIYTAHA AIELYAEAFE QAGALEKLEA FASFHGADFY
QLPRNQDKIT LKKENWRVPA QLEFGGESLI PFRAGENVTW ALDKTGN