Gene Nmul_A1826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1826 
Symbol 
ID3784921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2086018 
End bp2088372 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content53% 
IMG OID637811913 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_412515 
Protein GI82702949 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCT ATACAAAAAC TATTAAAGGG CTCTGTCCAC TTACACGACC CACCAAACTT 
GTAATCGCAA TACAGTACGC ACTCCTCAGC CTGGCCGTCG CTGGCCACGT GCATGCACAG
GACGCCTCCC CGGAATCAGC CAGAAAATCC CAAATGAATG TTCTTCCGGC AGGAGGTAAT
GCCGAAACCT CTGCCACAGA CTCTTCCGGA TCCGGAGACA AAGCTTCAAA GGGGAACGCC
CCATCGACTC AGGAGAATGC GCAGCAGGGG AAAAATCCAG AATCAGTCAA ACCTTCCGAA
ACGGTTCTGC CAACCCTGGT TATCCGGGAA AAAGGAGTCC AAGAGGGATA CGGCACTAAA
AGCAGTATAG TGGGAACCAA GACTAGGACT CCAATTACAG AGATACCGCA ATCCATGTCC
GTTATCAGCC GAAATGAATT GGATATGCGT GCCGTAAACC TCAATTTCAC TGAAGCACTA
CGATATATAC CGGGTGTGGT ACCGGACCAG TTTGGTTTCA ACGGAACGGG TTTCGAGTAT
GTAAGCATGC GAGGCTTCAA CTCCCTGAGT ACTGCCAACT TCCGGGATAA TTTAAGCCAG
CAGGGGAGAG GGCTTTATTT TGCCGACTTC ATCACTGACC CCTATTCTCT CGAGCGGGTA
GAGGTATTGC GCGGTCCCAC CTCGGTCATT TTCGGGCGAG GCGATGCGGG AGGTATCATA
AACCGTGTTA CGAAGCTGCC CACCTCCACT CCCATCCGCG AGGTCGAGCT CCAGTACGGC
AGTTTCGACC GCAAGCGGAT CGCCGGGGAT TTTGGACTGG CCAATGAAGA TGGAACACTG
ATGTTCCGGC TGGTCACGAC CGCGCTCGAT ACGGACACGC AGGTGCGCTT TCCGAATACG
GGCGGAGATC GGGCCCAGAT CAGGCGCTTT TACATCTCGC CGTCGCTGAC TTGGCGCCCC
ACCGACCGGA CATCCGTCAC ACTTTTCGGC GATATCCTGA ACAACCGCAG CGAAGCGTCT
GCCTTTTACG TAGCAACGCA AGGGGGCAGC CCCACTAATA CGTTGCTGGG CGAACCCACC
TTCACGCGGT ATTCGACCGA CCAGGCTTCC TTCAGCTACA AACTCGAGCA CCATTTCAAC
GACACCTTCA CGGTACGGCA AAACTTCCGT TTCATGGGAC TCGATGGCAG GTTCCGAGAC
CTTAACCCCG CAGGATTTGA CGCGGATGGA CGGACCTTGT TTCGCAGTGC ATTGAGTACG
CGGGAGCGGG TGAATCAAAC GGTGCTGGAT ACACATGTGG AAGCTCGTAC CCGGACAGGC
CCCCTCAATC ATACCGTACT GGCCGGAATC GACTGGAATC GCGTCGAGTC CACCCTTAAA
TCTTTTACGG GCAGCGCTCC CTCCATCGAT ATTTTCAATC CCGTATATTT CCAACCGGTG
CCCACCCCGG ATTCCCCGTT CATTGATGGC AACCAAAAGA TAGACCAGGT TGGATTTTAC
GTACAGGATC AAATCAAGCT CAACCAATGG CTTTTGACTC TGAGCGGCCG CCACGACAGG
GTGTCGAACG TTACCAATGT TAATGTTTTC GATCCGCAGC ATACCGCTAG CAAGGATTCA
GCATATACCG GCAGGGCGGG GCTGACCTAC CTTTTTTCCA ACGGCATCGC GCCTTATTTC
AGTTATTCGC AGTCGTTTCT GCCTCAATAC GGAATCGATT TTGGTAATAA CAGCCCCTTC
AAACCTGCGC GCGCTTCCCA GTATGAAGTG GGCATCAAGT ATCAACCCCC GGGCACCAGG
AATTTGTTTA CGGCAGCATT GTTCGAATTG ACCAAAACCA ACGTTTTAAT TCCTGATCCC
CGTACACCGC TTGGCGTGTC GCAAGCAGGC GAGATACGCT CCCGGGGAGC GGAATTGGAA
GCAAGGACAG AGGTTTTTCG AGGATTGAAT GCGATTGGCG CTTTCAGCTA TGTCGACGTC
AAGGTAACCG AAAGCGCCAG TGGTTTTACA GGTAACATGC CTGTGCGGGT ACCCAACCTG
ACGACCTCAG GCTGGCTCGA CTATAACCTC GGCACATTGA ATGTCGACTG GTTGAAGGGG
TTCTTCATCG GGGGCGGCGT GCGCTATGTG GGCAGAGTGT TCAATGATGA GGCGAATACC
AGCACCACAC CTTCCTTTAC CCTGTTCGAT GCGGTCCTGC GATATGATCA CGGCCCCTGG
CAATTTCTCA TCAATGCCAA CAACATATTC GATGAGAAAT ACTATACCGC CAATAATGTC
GTCCCGGGTT CCGGCGGACA GTTCTTCCTG GGGACGCGAC GTACTGTGAT CGGGACGTTG
AAACTCAGGT TTTAA
 
Protein sequence
MKAYTKTIKG LCPLTRPTKL VIAIQYALLS LAVAGHVHAQ DASPESARKS QMNVLPAGGN 
AETSATDSSG SGDKASKGNA PSTQENAQQG KNPESVKPSE TVLPTLVIRE KGVQEGYGTK
SSIVGTKTRT PITEIPQSMS VISRNELDMR AVNLNFTEAL RYIPGVVPDQ FGFNGTGFEY
VSMRGFNSLS TANFRDNLSQ QGRGLYFADF ITDPYSLERV EVLRGPTSVI FGRGDAGGII
NRVTKLPTST PIREVELQYG SFDRKRIAGD FGLANEDGTL MFRLVTTALD TDTQVRFPNT
GGDRAQIRRF YISPSLTWRP TDRTSVTLFG DILNNRSEAS AFYVATQGGS PTNTLLGEPT
FTRYSTDQAS FSYKLEHHFN DTFTVRQNFR FMGLDGRFRD LNPAGFDADG RTLFRSALST
RERVNQTVLD THVEARTRTG PLNHTVLAGI DWNRVESTLK SFTGSAPSID IFNPVYFQPV
PTPDSPFIDG NQKIDQVGFY VQDQIKLNQW LLTLSGRHDR VSNVTNVNVF DPQHTASKDS
AYTGRAGLTY LFSNGIAPYF SYSQSFLPQY GIDFGNNSPF KPARASQYEV GIKYQPPGTR
NLFTAALFEL TKTNVLIPDP RTPLGVSQAG EIRSRGAELE ARTEVFRGLN AIGAFSYVDV
KVTESASGFT GNMPVRVPNL TTSGWLDYNL GTLNVDWLKG FFIGGGVRYV GRVFNDEANT
STTPSFTLFD AVLRYDHGPW QFLINANNIF DEKYYTANNV VPGSGGQFFL GTRRTVIGTL
KLRF