Gene Nmul_A0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0140 
Symbol 
ID3784112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp145691 
End bp146890 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content55% 
IMG OID637810211 
Producthypothetical protein 
Protein accessionYP_410841 
Protein GI82701275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATTG CAGGCCAGCC AAAAAAAATT ACTGCATTGA CACGTGTGCG AGACACGTTA 
GCGATACCTC TCGCTGGAAT TGCCACATTT TTCTATGTTC TATGGCAATT GGTGACGGAG
CAGCAGCTTT TTCTGCGTAC CTTTCTTGTG GAGCGCATGG GCAAGCGTGC TGTGAAGAAA
CTGCCGCTGG GATATTGGAA TGTGCTGGCG CTCAAAAGCA TCATGTTGTG CCAGCATTTG
TCTCATAACC CGAACCTGCT GCGAGATGCC CGCAAGCGTA CCGAATCGCG CATCCTGCAA
CGGTTGCAGG ATCGGGGCGT GCCGCGCGGG CAAGTTCTCC CCATCACGGA ATATCAACCC
GTCCAGATAG AGCCCCATCG GTTTTACCGG GAGCATGTGA AGCGCGGCGT ACCGTGCATT
ATGCGCGGGT TCGTGGGTAA CGCGCCGATC GACTGGACGC TCGACAAACT TGCGGAACGT
TTTCCGGATA CCATCGTCCA GGCGCTGGAC AAGCGGAGCA AGAAGATGGT GAATGTCTCT
CTGCGCGAGA TTGCGGAGGA CCGCCGCTGC AATTACATTC CGCAGCAATT GTTGCTCGAT
CAAAATCCGA CTTTCTACGA ATATTTCCGC ATTCCGCGCT CGCATGCGTA TTTCCCCGTC
ATGGGACGGC CCTCGAAGCC GGTTCTGAGT TTTCTGATCT TAGGCCTGGG AGCGGGATTG
AACGCCAACT ACCACTGTGA GGAAGGGCCC AACTGGTATC TTGCCGTTTC CGGTTCCAAG
CGCTGGACCT TGATCGAATC CGAATATTCC TGGTTGCTAT ACCCGGCAGC GCTTGGCAAC
GGCATGCGCC GGTTTGCGGA GTTCATCGCG GATAAGGAAG GGGAACCGAG CGACCGGGAT
GCGTATCCCC TGGTGGAGTA TGCGCCGCGT TACGAGTTCG AGCTTCATCC CGGCGACGTT
CTGTTTTTCC CTGCCTGGAT GTGGCACAAG ACGATCAACC TCAATGAAGA AGGCCTGGGG
GTCACCTGCC GTTACACTGC CCCGACCGAA ATCTCCAACA GATATTTCCG GGCCCTGCAA
CTGCTCTCCG GAGGGTTCTG GAAAAGCTGC GTGGAAGTCA TTAGTTGCGG CATACGGGGT
AATATCGCCT CTCTCGCCAG TGATACCGAC CACAACGAGC AGGAAACAGT GTTGTACTGA
 
Protein sequence
MEIAGQPKKI TALTRVRDTL AIPLAGIATF FYVLWQLVTE QQLFLRTFLV ERMGKRAVKK 
LPLGYWNVLA LKSIMLCQHL SHNPNLLRDA RKRTESRILQ RLQDRGVPRG QVLPITEYQP
VQIEPHRFYR EHVKRGVPCI MRGFVGNAPI DWTLDKLAER FPDTIVQALD KRSKKMVNVS
LREIAEDRRC NYIPQQLLLD QNPTFYEYFR IPRSHAYFPV MGRPSKPVLS FLILGLGAGL
NANYHCEEGP NWYLAVSGSK RWTLIESEYS WLLYPAALGN GMRRFAEFIA DKEGEPSDRD
AYPLVEYAPR YEFELHPGDV LFFPAWMWHK TINLNEEGLG VTCRYTAPTE ISNRYFRALQ
LLSGGFWKSC VEVISCGIRG NIASLASDTD HNEQETVLY