Gene Nmul_A1677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1677 
Symbol 
ID3785664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1918218 
End bp1919084 
Gene Length867 bp 
Protein Length288 aa 
Translation table11 
GC content57% 
IMG OID637811763 
Producthypothetical protein 
Protein accessionYP_412367 
Protein GI82702801 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain
[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTA CTCCTAGCTC TAAAATCCAC AGCCTGATTC TGGGGGCAGC CCTGAGTATC 
GCACCTGGTT TTGTTTCCCC TGTCTCCGCA CAGGAACGTT CATATATCCT CAACTTCTAT
GACAACAGTC TAACCGATCT CGGGACGCTG GATCTGGGTG GAGGTTCCAG TTACGCCCGT
GGCATCAATG ATACCGGGCA GGTGATGGGG GAGTCCCTTC TTCTAGGCGA CCCGAATAAT
GCGCACGCTT TTATCACCGG TCCCAATGGT GTGGGCATGA CCGATCTCGG GACGCTAGGG
GGAATGTGGA GTACTGCCAA CGACATCAAT AATGCTGGGC AGGTGGTGGG GAGCGCAGGC
ACGGCCGCAG GTGAGCGTCA CGCTTTTATC ACCGGCCCCA ATGGCGAGGG CATGACCGAT
CTCGGGACGC TGGGGGGAAA TTACAGTACC GCCAACGACA TCAATAATGC TGGGCAGGTG
GTGGGGTGGT CCACCACGGC CTCAGGTTCC GAGCACGCTT TCATCACTGG TCCTGATGGC
GTGGGCATGA CAGATCTCGG GACGCTGGGG GGAAATTACA GTACCGCCAA CGACATCAAT
AATGCTGGGC AGGTGGTGGG GAACTCCGCC ACAGCCGCAG GTGAGGGACA CGCTTTTATC
ACCGGCCCCA ATGGCATGGG CATGACAGAC CTCAATTCGC TGGTTGAGTG GCCAGCCGGA
ATTGCTCTAG CGAACGCTGT CGACATCAAT AACGTGGGAC AGGTCCTCGT CAATGCTGCG
ATCCCTGAGC CTCAATCCTA TGCTTTGATG CTCGCGGGCC TCATGCTGGT CGGATTCATG
GTTCGGCGAA AAAGCCTGCC GGCATAA
 
Protein sequence
MKTTPSSKIH SLILGAALSI APGFVSPVSA QERSYILNFY DNSLTDLGTL DLGGGSSYAR 
GINDTGQVMG ESLLLGDPNN AHAFITGPNG VGMTDLGTLG GMWSTANDIN NAGQVVGSAG
TAAGERHAFI TGPNGEGMTD LGTLGGNYST ANDINNAGQV VGWSTTASGS EHAFITGPDG
VGMTDLGTLG GNYSTANDIN NAGQVVGNSA TAAGEGHAFI TGPNGMGMTD LNSLVEWPAG
IALANAVDIN NVGQVLVNAA IPEPQSYALM LAGLMLVGFM VRRKSLPA