Gene Nmul_A1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1874 
Symbol 
ID3786524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2160005 
End bp2161147 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content55% 
IMG OID637811960 
Producthypothetical protein 
Protein accessionYP_412561 
Protein GI82702995 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGACT ACCTGCTGAT CATTACTGCT GCCGCGATTT TGCTTGTCAC CATGGCCTAT 
CTGGTGATCC GGCTCGGTAT GCTTTCGACC ATGGTGCGCG AACTGCTGGG GCAGCAGGCA
AGGCTGATGG AAGACAAGCA TCGCGACATG CTCAAGGATT TGCATGAAGG GCTGTCGAAC
CAGGGCAACC GGCTTTCCGA GGTCCTGGGC AGGAATTCCG ATCAACTGCG GGGAACGGTG
GAAGCGCGGC TGGACCAGAT CAGCGGGAGG GTGGCTGAAC GTCTCGACGA AGGGTTCAGG
AAAACCAATG AGACTTTCAC CAGCGTCATG ACGCGGCTTG CGACGATCGA TGAAGCCCAG
AAGAAGATCG ACAGCCTGAC CACCAATATG GTGAGCCTGC AGGAACTTCT GGGCGACAAA
CGCTCGCGCG GCGCGTTTGG CGAAGTGCAA CTGGAAGCGC TGGTTCGCAA TATCCTGCCG
CCCTCTGCAT ATGAAATGCA GCATACGCTT TCCAACAGCA GCCGCGCCGA TTGTGTGCTG
AAGCTACCGC CTCCAACGGG CATGGTCGCG GTCGATTCAA AATTTCCGCT GGAAAATTTT
CATCGCATGT TCGATCGTCA TACGGATGAC ACGAGCCGTG CCCTGGCGCA GAAGCAGTTC
AAGGCGGACG TGAAAAAACA TGTGGACGAC ATTGCCGGTA AATATATCCT GCCGCCGGAA
ACCTGCGATG GAGCGGTGAT GTTCGTACCG GCGGAGGCCG TTTTCGCCGA AATCCATGCC
TATCATTCGG ACATAGTCGA TTACGCCATG CAGAAGCAGG TCTGGATAGT TTCGCCTACC
ACCCTGATGG CGGTACTGAA TACCGCGCGT GCGGTGCTCA AGGATATCGA AACGCGCGAG
CAGGTACACA TCATCAAGAA CGAACTGTCC AGGCTGGGCA AGGATTTTGC ACGCTTTGAC
GAGCGCATGA AAAAACTTGC AGACCATATC CGCCAGGCCA ATCAGGATGT GGAAGAGGTA
CATGTTTCAA GCCGGAAGAT AAGTCAACGC TTTGCCCGCA TAGAGGCCGT GGATCTCGAG
CTGCCTCAAC TGGAAATGGA AACGCCAGTG ATGCAACCGG CGGACGAAGA AAACTCCAGA
TAA
 
Protein sequence
MPDYLLIITA AAILLVTMAY LVIRLGMLST MVRELLGQQA RLMEDKHRDM LKDLHEGLSN 
QGNRLSEVLG RNSDQLRGTV EARLDQISGR VAERLDEGFR KTNETFTSVM TRLATIDEAQ
KKIDSLTTNM VSLQELLGDK RSRGAFGEVQ LEALVRNILP PSAYEMQHTL SNSSRADCVL
KLPPPTGMVA VDSKFPLENF HRMFDRHTDD TSRALAQKQF KADVKKHVDD IAGKYILPPE
TCDGAVMFVP AEAVFAEIHA YHSDIVDYAM QKQVWIVSPT TLMAVLNTAR AVLKDIETRE
QVHIIKNELS RLGKDFARFD ERMKKLADHI RQANQDVEEV HVSSRKISQR FARIEAVDLE
LPQLEMETPV MQPADEENSR