Gene Nmul_A0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0871 
SymbolprpD 
ID3784441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp989317 
End bp990768 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content57% 
IMG OID637810953 
Product2-methylcitrate dehydratase 
Protein accessionYP_411566 
Protein GI82702000 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000221374 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCCG CGTCCTCCCA CATTCGTCCC CAGCCTGACC AGGTTCTGGT CGACATTGCA 
GATTATGTGG TGGGACGGGA TATTCAAAGC GATCTGGCTT ACGCTACCGC CCGTCACTGC
CTGATGGATT CCCTGGGGTG CGCCATGGAA GCGCTTGCGT ATCCGGCCTG CACCAAGCTG
CTTGGGCCGC TCGTGCCCGA GGCTGCCGAA TCTGAGGGCG GCGCGAGGAT TCCGGGTACC
CAGTTTCAGT TCAATCCGGT GGAAGCGGCA TTCAATATCG GCACGATGAT TCGCTGGCTC
GATTTCAACG ATACCTGGCT GGCTGCGGAA TGGGGTCATC CATCGGATAA CCTGGGGGCC
ATACTGGCTG TAGCCGACTG GCTTTCGCGC AGTGCCACGA GCGGGAGGCG GCGGCCGCTT
ATGCGCGATG TGTTGACCGC GATGATAAAG GCATACGAGA TCCAGGGTTG CTTCGCTTTG
GAAAACAGCT TCAACAGGGT GGGCCTGGAT CATGTGGTGC TGGTTAAAGT TGCCTCCACA
GCCGTCGTAA CGCATCTGCT GGGCGGCAGC CGTAAACAGA TCGTCGATGC GCTTTCCCAG
GCGTGGGTGG ACGGGCAGGC ACTTCGCACG TATCGTCATG CCCCCAACAC CGGCTCGCGC
AAATCCTGGG CTGCGGGGGA TGCCACCAGT CGTGCGGTAT GGCTTGCGCT GATCACACTC
AAGGGGGAGC CAGGCTATCC TTCAGCGCTT ACCGCGAAAA CCTGGGGGTT CTACGATGTG
CTGTTCAAGG GAGAACCTTT CAAATTCCAG AGGCCCATCG AACGATGGCA TTCCTATGTA
ATGGAGAATG TGCTGCTCAA GATTTCGTTC CCCGCCGAAT TCCACTCCCA GACAGCGGCG
GAGTGTGCGA TGCAGCTATA CCCGCACGTG AAGGACCGGA TTGCCGATAT CCGGAAAATA
ACGATCCGCA CCCACGAGGC CGCGATTCGT ATCATCGACA AGAAGGGGCC GCTCAGCAGC
CCGGCCGACC GCGACCATTG CATGCAGTAT ATCGTGGCAG TGGCCCTTAT TTTCGGCAGG
CTTACCTCTG CCGATTATGA GGATGGCGTG GCCGCCGACC CTCGCATAGA TGCGTTGCGG
GACAAGATAA TCTGTATTGA GGATCCGTGC TTCACAAAGG ATTATTACGA TCCCGAAAAA
CGCTCCATCG CAAACGGTCT CACCCTTGAA TTCAGGGACG GCAGTAAACT GGAGGAAGTG
GTGGTGGAGT ATCCCATCGG CCACAGGTTG CGTCGAAGCG AAGGCATTCC GCTACTGGAA
GAAAAATTTA GAATCAATCT TGGGCGGCGT TTCCCCGCCC AGCAGTGCGA GGCAATCATG
AATGCCTGTC ATGATCAAGG CAGGCTGGAA GCAATGCCGG TTCACGAGTT TATCGATCTG
TTTGTGATTT AG
 
Protein sequence
MSSASSHIRP QPDQVLVDIA DYVVGRDIQS DLAYATARHC LMDSLGCAME ALAYPACTKL 
LGPLVPEAAE SEGGARIPGT QFQFNPVEAA FNIGTMIRWL DFNDTWLAAE WGHPSDNLGA
ILAVADWLSR SATSGRRRPL MRDVLTAMIK AYEIQGCFAL ENSFNRVGLD HVVLVKVAST
AVVTHLLGGS RKQIVDALSQ AWVDGQALRT YRHAPNTGSR KSWAAGDATS RAVWLALITL
KGEPGYPSAL TAKTWGFYDV LFKGEPFKFQ RPIERWHSYV MENVLLKISF PAEFHSQTAA
ECAMQLYPHV KDRIADIRKI TIRTHEAAIR IIDKKGPLSS PADRDHCMQY IVAVALIFGR
LTSADYEDGV AADPRIDALR DKIICIEDPC FTKDYYDPEK RSIANGLTLE FRDGSKLEEV
VVEYPIGHRL RRSEGIPLLE EKFRINLGRR FPAQQCEAIM NACHDQGRLE AMPVHEFIDL
FVI