Gene Nmul_A0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0438 
Symbol 
ID3785906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp486326 
End bp487654 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content54% 
IMG OID637810514 
Producthypothetical protein 
Protein accessionYP_411138 
Protein GI82701572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTGCA GGCGAAAGCG CAGCAGCCGG ATTCCGTACG GAGTAGGTTG TTTTTTATTG 
ATCTTCATGA GCAACAGCAT TGTTCAGGCG GTTGACATAG CAGGGTGGTT GCGGGAGAAC
GGTGTAAGGG CGGGCGGGTG GATCAATGCG GGGGCAACCT ATAACCCCGG TAATCCCGTC
AACGGATACA ATGGCCCCGT TACCTTTGCC GACCGCGCAA ACCGGTTTCA GTTGAATCAG
TTCAATCTAT ATCTCCAGCG CACGGTAGTC ACGGAAGGCG AGGCTTTTGA TCTCGGGGGG
CGGGTCGATT TCATGTTCGG CACCGATGCG ATCTATACCC AGGCCTTCGG CGTTCCGCCA
TTCGATGTGA ATACCGGTCA AGTCTTGAAT AGGGGCCACT GGGATCTGAA TATATGCTGC
TCATCGACCC GGACGTATGG TATCGCGCTA CCCCAAGCGT ATCTCGAGGC CTATATACCG
TTGGGTAATG GCTTGAACAT CAAGGCAGGT CATTTTTATT CGCCGACAGG GTATGAAACC
GTTCCTGCGC CCGACAATTT TTTTTACACT CACGCCTATA CCTTTAACAA TGGGGAGCCA
TTCACCCACA CGGGTGCCGT AGGGAACTAT ACCGTCAACG GGAACTGGTC GGTGATGGGC
GGCCTCATCA CCGGGAGCGC AACCGGCGAT TGGGATGGCG GATTCGACAG AGAGCTGGGA
AACTGGGGAG GGCTGGGCGG TATTACCTGG GTCAGCGACG ACAAAAGGAC TTCGGCCAAC
ATCACGGGGA GCTACAGTGC GACGTCCACC CGCAGCAACA GGCCATGGGG AATGTACAAC
ATTGTGCTGC AACACAGGAT CACACCGAAA ACCCATCTGG TTCTACACCA CGTTCACGGC
TATGCCGGCG GCGTTTTGCT GGGAGGGGTG CCAAAGAATG TCGAATGGTA TGGCATCAAT
ACCCACCTCT ATTATGACTT GTTGGAGGAT CTCTCGGTGG GTATTCGCGG CGAATGGTTC
CGGGACCGGG ATGGTTTTCG CGTCTCTTCG CCCTTTCGCG TGGTAGCGGC GCTCAACCAT
ACCGGAATCA GTTTTGCAGG CGATTCTTCC ACAGTGAGGG CGGCCCCTGC GGACTACTAT
GAAATCACCT TCGGAATGAA CTGGAAGCCA GCGAAAAGGT TGCGGCTGGA CTGGAAGCCG
ATGCAAAAGC TGAACGTTCG TCCAAATATC CGCTATGACC GTGCGGACGG CATTGACCAT
TCGCATCGGC CTTTCAACAA CAAGAAGGAT CAGATTATAT TTTCTCTCGA TGCGATGATT
CCATTCTGA
 
Protein sequence
MKCRRKRSSR IPYGVGCFLL IFMSNSIVQA VDIAGWLREN GVRAGGWINA GATYNPGNPV 
NGYNGPVTFA DRANRFQLNQ FNLYLQRTVV TEGEAFDLGG RVDFMFGTDA IYTQAFGVPP
FDVNTGQVLN RGHWDLNICC SSTRTYGIAL PQAYLEAYIP LGNGLNIKAG HFYSPTGYET
VPAPDNFFYT HAYTFNNGEP FTHTGAVGNY TVNGNWSVMG GLITGSATGD WDGGFDRELG
NWGGLGGITW VSDDKRTSAN ITGSYSATST RSNRPWGMYN IVLQHRITPK THLVLHHVHG
YAGGVLLGGV PKNVEWYGIN THLYYDLLED LSVGIRGEWF RDRDGFRVSS PFRVVAALNH
TGISFAGDSS TVRAAPADYY EITFGMNWKP AKRLRLDWKP MQKLNVRPNI RYDRADGIDH
SHRPFNNKKD QIIFSLDAMI PF