Gene Nmul_A1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1511 
Symbol 
ID3786097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1728258 
End bp1729556 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content54% 
IMG OID637811599 
Producthypothetical protein 
Protein accessionYP_412206 
Protein GI82702640 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.92317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGC CGTTGTACGT AAAATATTCC TCGATCCGCG ACCGGACTTT GAGTCTGATC 
GAGCCGCTGA TCGACGAGGA CTGCTGTGTC CAGTCCATGC CCGGGACAAG CCCGGTAAAG
TGGCACCTGG GGCATGCTTC CTGGTTTTTT GAGAAGTTTG TATTGCAGCA CTATGAAAAG
CCCTTCACCC CTTTTCACCC GGCTTTTCTC ATGATGTTCA GCTCGTACAA TAAATCAGGT
CAGACCCATC CTGATCCGAA ACGCGGGCTG TTCACCCGCC CATCCTTGTC CTTGACGCGA
GAATACCGCA ATAACGTCAA CGAACGTATG GAGCAGGTCC TGAAACGTTC GGAAGAAGAC
GAAATGCTGC GCATGCTCGC GGTGCTGGGA ATGCATCATG AGCAGCAGCA CCAGGAACTC
ATGCTGGCCG ACGTCAAACA TCTGCTTTCC CAAAGCCCTT TGAACCCTTC CTATAATAGC
CAGCCGCTCC TTGATTCCCC TGTCCCCCCG CCCCTCGAAT GGTGTCGTTT TGACGGGGGT
CTCGTTGAAA TAGGCTATAA GGGAGACGAG TTCAGCTACG ACAATGAATC GCCCCGTCAC
AAGCAATACC TGCAACCCTA TCAGCTCGCA TCCCGGCTTG TCACCAATCG CGAATACCTC
GAATTCATGA AAGCCGGCGG ATATGACAAT CCCGGCGTGT GGCTTTCCGA AGGATGGGAC
TGGATGAAAG CCAACCGCCG GTCGCATCCT CTTTACTGGC GGGAAAGCGA TCAGGGATGG
GAGGAATTCA CTCTCAGCGG TGCCATGCCA CTGGACCTGA ATCTGCCGGT CATTCATGTG
TCCTTCTACG AAGCCGATGC CTTCGGACGA TGGGCGGGCG CCAGACTCCC CACGGAAGCT
GAATGGGAAA ATGCTGCTTC TCAACAGGAA ATAGAAGGTT GCTTCGCTGA TAACAACCGT
TTTCATCCCT CCTCCGCAGG CGGCTCTACG CCATCTGCAA ATACCGGGGG TCTTGCTCAA
CTCTATGGTG ATGCATGGGA ATGGACGCAG TCGAGCTACT CCCCTTACCC GGGTTACAAT
CCTGCAAAAC CCAACGAGAA CGAACCGATG TCCTTTGTCT GGGATGAGGC GGTAGGCGAA
TATAACAGCC GGTCTATGGT GAACCAGTAT GTGTTGCGCG GCGGGTCATG CGCAATTCCA
AAAGAGCGGA TACGGGCAAG TTTTCGTAAT TTCTTCCCCG CGGATACATG CTGGCAGTTT
TCCGGAATTC GTCTTGCAAG AGACTTGAGA GATTCTTAA
 
Protein sequence
MTTPLYVKYS SIRDRTLSLI EPLIDEDCCV QSMPGTSPVK WHLGHASWFF EKFVLQHYEK 
PFTPFHPAFL MMFSSYNKSG QTHPDPKRGL FTRPSLSLTR EYRNNVNERM EQVLKRSEED
EMLRMLAVLG MHHEQQHQEL MLADVKHLLS QSPLNPSYNS QPLLDSPVPP PLEWCRFDGG
LVEIGYKGDE FSYDNESPRH KQYLQPYQLA SRLVTNREYL EFMKAGGYDN PGVWLSEGWD
WMKANRRSHP LYWRESDQGW EEFTLSGAMP LDLNLPVIHV SFYEADAFGR WAGARLPTEA
EWENAASQQE IEGCFADNNR FHPSSAGGST PSANTGGLAQ LYGDAWEWTQ SSYSPYPGYN
PAKPNENEPM SFVWDEAVGE YNSRSMVNQY VLRGGSCAIP KERIRASFRN FFPADTCWQF
SGIRLARDLR DS