Gene Nmul_A2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2378 
Symbol 
ID3784969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2705002 
End bp2706348 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content60% 
IMG OID637812467 
Producthypothetical protein 
Protein accessionYP_413059 
Protein GI82703493 
COG category[S] Function unknown 
COG ID[COG1426] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.654277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCTC CTGAAAGTAG CGACATCGAG CAGCATGAGG AACCAAAAAA AACCGGAGAG 
CAGGTCGGCC AGGTGCTGCG CGCTGCGCGG CTGGAACGCG GCCTGGATAT TGAGGACGTT
GCGCGCCAGT TACGCTTTGC AGCCCGGCAG GTGACGGCAC TCGAAGAGGA TGAATACGAT
AAGCTTGCGG GCGGCCCCTT CCTGCGCGGA TTCGTGCGCA ACTATGCCAA ATTGCTGCAA
CTGGATGAAG CGCCGTTATT GAAGTTGCTT GAACAGTCGG TTCCTCCTCC GACGACGCAC
GTGGGGCGGC CTCCAAGTGA GGAGATTCCC TTTCCCTCCG GGCAGGAGTA TCTGAAACGC
AACGTGGTCC TCGGCGGAGG AATTGTCCTG GCCATCACTC TGCTGGGGTA CGCGATCTAT
AGTGGTGACA AGGCTTCCGT TGCCAATCAG CCCGACATGG CAATGGAATC GGAGAAAGAC
ACCGGGCAAC CGACTCTTTC ATTCCCGTTT CCATCGCAGG CGCCCCCAGC GGAAGTGCCG
GAATCTCAGG CTCCCGCACC TTCCGCGCTT GTTCCCGATA TGGCTTCCCA GCGAGAGCCG
GGTATTGCCG CCCTCGAGGA GACAGCACCT TCCGCGGATA CAGGCAGAGA GCAGGACACC
GTGGCGTCCG CTCCTAAAGA CGCGGCCGCT GGGAGCGGTG CCGAAGCTTC CGCTGGAGAG
CCTCTCACCC TGCCTCTTGC GCCGCCACCG GCGGTCCAGA CCGCTCCCAC AGCACCGGCA
GTTCCAGGGG GGGAGCCCGC CAATGTCCCA GTGGCTCCGA AGCCGCCCCA GGTTACAGTT
GCTCCAAAGC CATCTGAGGG CCCAGTTACT TCGAAGCCAT CTGGGAGCGC AGTTACCCCG
AAGCCGCCTC AGGTCGCAGT TGCTCCGAAG CCACTTGAGG GCCCAGTTAC TTCGCAGCCA
TCTGAGAGCG CAGTTACTTC GAAGCCGCCT CAGGTCGCAG TTGCTCCGAA GCCATCTGAG
GGCGCAGCTA CTTCGAAGCC ATCTGAGAGC GCAGTTACCC CGAAGCCGCC CCAGGTCGCC
GCTACTCCGA AGCCACCTCC TGCAAAAGCT GGAATTCGTC TGACGTTTGC CGGCGAGTCA
TGGGTGCAAG TCAAGGATGG CAACGGGAAA TTGCTTCTCT CAAGAGTCAA TCCTCCTGGC
AGCGAGCAGG TGCTGCGTGG TAAGCCGCCC TATTCTCTCA TGATCGGAAA CCCGGGGCAG
GTCAAGCTTG TCTACAATAG CAAACCAGTC GATCTCTCGA TTTTCGCCAA ACTTCCCGGT
GGAATGGCAC ATCTCGTGCT CCAATAG
 
Protein sequence
MEAPESSDIE QHEEPKKTGE QVGQVLRAAR LERGLDIEDV ARQLRFAARQ VTALEEDEYD 
KLAGGPFLRG FVRNYAKLLQ LDEAPLLKLL EQSVPPPTTH VGRPPSEEIP FPSGQEYLKR
NVVLGGGIVL AITLLGYAIY SGDKASVANQ PDMAMESEKD TGQPTLSFPF PSQAPPAEVP
ESQAPAPSAL VPDMASQREP GIAALEETAP SADTGREQDT VASAPKDAAA GSGAEASAGE
PLTLPLAPPP AVQTAPTAPA VPGGEPANVP VAPKPPQVTV APKPSEGPVT SKPSGSAVTP
KPPQVAVAPK PLEGPVTSQP SESAVTSKPP QVAVAPKPSE GAATSKPSES AVTPKPPQVA
ATPKPPPAKA GIRLTFAGES WVQVKDGNGK LLLSRVNPPG SEQVLRGKPP YSLMIGNPGQ
VKLVYNSKPV DLSIFAKLPG GMAHLVLQ