Gene Nmul_A2205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2205 
Symbol 
ID3786230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2504798 
End bp2505943 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content54% 
IMG OID637812292 
Producthypothetical protein 
Protein accessionYP_412889 
Protein GI82703323 
COG category[S] Function unknown 
COG ID[COG1432] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00288] conserved hypothetical protein TIGR00288 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACAC TTCAGGAAAA TGTCAGCATG GCGGTATTTT GTGACTTCGA AAATGTCGCT 
CTTGGCGTGC GTGATGCGAA ATACGAGAAG TTCGATATCA AACCGGTGCT GGAGCGATTG
TTGCTCAAAG GCAGCATTGT CATAAAAAAG GCTTATTGCG ACTGGGATCG TTACAAGACC
TTTAAAACTG CGCTACACGA AGCAAATTTC GAGCTGATCG AGATTCCGCA CATTCGTCAG
TCCGGTAAAA ACTCAGCCGA CATCCGCCTG GTCGTGGATG CGCTCGATCT TTGTTACACG
AAATCGCATG TGAACACGTT CGTCATTATC AGTGGCGATT CCGATTTCTC GCCCCTCGTC
TCCAAGTTGC GCGAGAACGC TAAACAAGTA ATCGGTGTTG GAGTCAAGAA ATCAACTTCA
GATCTGCTGA TAGCCAACTG CGATGAATTC ATTTTTTATG ACGATCTCGT GCGGGAAATC
CAGCGTACTG CCGCGAAACA CGATACGGCA GAAGCACCGC CTGTGGTAAA ACAGGCGCCT
GCAGAGGAAA AACGCCCAGG AGAGGAACTC GAAGCACGCA AGGGCCAGGC AATCAACATG
GCGGTAGAGA CTTTCGACGC CCTGGTATCC GAGCGGGGCG ACAGCGGAAA AATCTGGGCA
TCGACACTCA AGCAAGCGAT CAAGCGCCGC AAACCTGGTT TTAACGAGTC TTATTATGGC
TTTCGTGCCT TCGGTAATTT GCTTGAGGAG GCGCAAGCCC GAGGCCTGCT TGAACTGGGC
CGTGACGAGA AATCGGGCAC GTATGTTTAT CGCAGCAGCA GCGCGGCCGA TAGCGGTACA
GCGGTGGAAG CGTCGGCGCC TGCTCCTGCT TCGCTGGAGA TTGCGGCCCC CTCAGCGCCG
CTCGACGGGC TCATCTCCCT GGATGAAACA CATCAGCGGA GAAAGAGTCG TAAGGCCGGG
GAAAAGAAAG CAGATTTGCT GCCAGCAACT CCTGTGCCCG GGGCAGAAAC TGCACAGACT
GCCGGCAATC CGGCAGATCA AACAGTGCCG GAAAAACCAG GAACCAAGGG AAGGAAAAAG
CCAGCCACAC GCCCTCCGCG CAAACCGGAA GCCGGGACAG ACAAGATCGA CAAGAAGCCT
GGTTAG
 
Protein sequence
MATLQENVSM AVFCDFENVA LGVRDAKYEK FDIKPVLERL LLKGSIVIKK AYCDWDRYKT 
FKTALHEANF ELIEIPHIRQ SGKNSADIRL VVDALDLCYT KSHVNTFVII SGDSDFSPLV
SKLRENAKQV IGVGVKKSTS DLLIANCDEF IFYDDLVREI QRTAAKHDTA EAPPVVKQAP
AEEKRPGEEL EARKGQAINM AVETFDALVS ERGDSGKIWA STLKQAIKRR KPGFNESYYG
FRAFGNLLEE AQARGLLELG RDEKSGTYVY RSSSAADSGT AVEASAPAPA SLEIAAPSAP
LDGLISLDET HQRRKSRKAG EKKADLLPAT PVPGAETAQT AGNPADQTVP EKPGTKGRKK
PATRPPRKPE AGTDKIDKKP G