Gene Nmul_A1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1478 
Symbol 
ID3785452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1687227 
End bp1689179 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content52% 
IMG OID637811566 
ProductOuter membrane autotransporter barrel 
Protein accessionYP_412173 
Protein GI82702607 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA GAAAAAAATC AGCTTCTCTT TCTCTTTATG CGAAATTCAT TATTGCTTTG 
TTGATGGCTC CGGTTGCGTC GTTATCCTCT CGGGCTCAGC AAGGCCTTAT GGGGGATGTA
AGCAGTATCG AGAAGGGAAT GAGCCGTAGT AACGATACCC GGAGCACAGC TTCTGCCACT
TTGCCTGATG GCATAAGTTT TCAGTCATGG CTCCCCAAAA CGCAGTTCAG CGATGCCGAA
GATACATTTC AGAATAATAC GAATAATGTG CAGGACTCCT CCCATCTGAT ATTCAATCAG
CCGGGCAGTA TCCACTATGA CGGTACGATT TCGGGGTCGG GCTTCCTGAA TAAAGCCGAC
TTGAGCATGC CGACATTATC AGGAGATTCG GGTGCATCCG GCGTGCATTC CACTGCCCGA
AGCAGCATCC TGGCGGTAAA TGGGATGGCG GACGGGATCC TGCACGTGCT GCACCATGCT
TCCCTTCACG GTACCGGACA TATAGGAGCC ACGAGGGTCG AAGGTACCGT GTCCCCGGGA
AATTCGATTG GCATGCTTAC GGTTCACGGC AATTATGTTC AGCTGAATGG TTCCACTTAT
GAAGTAGAAA TTCATCCCGA TGGCGCCAGC GACCAGGTCA TTGTCACAGA TCTTGCTGAT
ATACAAGGGG GAACAGTTTC CGTTATTCCT GCGGGAGAGG AATTCACACC TGGCAGCCGC
TTCACCATCC TGACAGCGAA TAGCGGTTTG ACTGGAAAAT TCGATTCACT CACGCATGGC
CTGATCAACC TGGGTATCTC CTATGATCCT ACTCACGTCT ACCTCGATGT TTTGCGCTTT
TGCGATATCG GCGAGACGCC TAATCAGTGC GCCACAGGAA GCGCAGCCGA GCGGTTGGGT
GGCGGCAATC CTATCTATAG AGCAATTGTC CATCAGCCAG ATCAGGAGAG CGTGCGCCAG
GCTTTTAACA GTCTTTCCGG CGAAGGGCAC GCCAGCATTC AGGGTATTAT CATTGAAGAC
AGCCGGTTTA TACGGGAATC GGTTTCAGAC CGGGTCAGGC AAGCTTTTCA TCTGGTGGGT
GCGCAGTCCT CTGATACCCC AGGACATATC CTTCAACAGA ACTCTGCTGC TGATGGGGCA
TTATGGGGAC GTGTCCTGGG TTCGTTCGGC CACCGGGATG GAGGCCTTAA TGCTGCCCGC
ATCGGGCGTG CGCTTGCTGG AATTTTTGTG GGCGGAGATA TGCGGATTGC TGATAAATTT
CTTCTCGGCG TTGCCGGAGG ATACACGCAG GGATCCTATG AAGGAGCGCG TCTTTTCACC
GCTTCAAGCG ACAATTATCA CGTTTCCGTT TATGGTGGCG GACAATGGGG ACCTCTTGGC
TTACGCGCGG GTTCCGCATA CATCTGGCAC GATCTGGAGA CAGGGCGGGA CGTGATCTTT
CCAGGTTTTT CAAATCACCT CAATGCCGAA TACAATGCGC GCGGAATACA GGTGTTCGGC
GAGGTTGGTT ATGGTTTGCC CCTGAACTTC ATCTCACTCG AACCATTTGC ACGGCTCGCT
TACATAAACC TGCGTACAAA AGGTTTTCAG GAGCGTGGAG GGATATCAAG TCTGCGCAGC
GGTAGCAGTC AGCAGGATAC CGCATATACT ACTCTGGGAA TCAATGTGGC GAAGACGCTA
TCCCGACTGG AGAAGATAGT TACCACGCTT CGCGGAAGCA TTGGATGGCG GCATGCTATG
GGTGAGATGA CGCCGGTTTC GACTTTCGCT TTTGCTAATG GCTCATCCTT TGCTACAACC
GGTGTGCCTA TTGCCCGCAA CGGAATAGTT CTCACCGGAG GCGTGGATGC TCATGTTTTT
GGAACTGCAA CCTTGGGCAT TTATTATCAG GGACAAATTC TTCACAACAT TGCCGACCAT
GGTGTTCGGG CAAACCTCAG CTGGAAATTC TGA
 
Protein sequence
MAKRKKSASL SLYAKFIIAL LMAPVASLSS RAQQGLMGDV SSIEKGMSRS NDTRSTASAT 
LPDGISFQSW LPKTQFSDAE DTFQNNTNNV QDSSHLIFNQ PGSIHYDGTI SGSGFLNKAD
LSMPTLSGDS GASGVHSTAR SSILAVNGMA DGILHVLHHA SLHGTGHIGA TRVEGTVSPG
NSIGMLTVHG NYVQLNGSTY EVEIHPDGAS DQVIVTDLAD IQGGTVSVIP AGEEFTPGSR
FTILTANSGL TGKFDSLTHG LINLGISYDP THVYLDVLRF CDIGETPNQC ATGSAAERLG
GGNPIYRAIV HQPDQESVRQ AFNSLSGEGH ASIQGIIIED SRFIRESVSD RVRQAFHLVG
AQSSDTPGHI LQQNSAADGA LWGRVLGSFG HRDGGLNAAR IGRALAGIFV GGDMRIADKF
LLGVAGGYTQ GSYEGARLFT ASSDNYHVSV YGGGQWGPLG LRAGSAYIWH DLETGRDVIF
PGFSNHLNAE YNARGIQVFG EVGYGLPLNF ISLEPFARLA YINLRTKGFQ ERGGISSLRS
GSSQQDTAYT TLGINVAKTL SRLEKIVTTL RGSIGWRHAM GEMTPVSTFA FANGSSFATT
GVPIARNGIV LTGGVDAHVF GTATLGIYYQ GQILHNIADH GVRANLSWKF