Gene Nmul_A0628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0628 
Symbol 
ID3784424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp709362 
End bp711428 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content54% 
IMG OID637810710 
ProductTonB-dependent receptor 
Protein accessionYP_411327 
Protein GI82701761 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTGT TTCGCAGGCA TCTCTTGCCA TGGATGTTTA TTTCCGTTCC TTCAGGGGCT 
CTTGCCGTAT CCAGCGATCC GGCGGAGAGA TTGCTCCCTG ATGTCGTTGT CACAGCAACC
CGCTTCAGCG AGTCCATCGA CAAACTGCCC AGCGGAGTTT CCATCGTCAA TGCGGAACAG
ATCAGAAACA GCGCCGCTAC GACTTTGCCG GAGTTGCTGC AACAACTGTC TGGCATACAC
ACGAGAAATG TCGATGGCAG CCCCGACCCA CAGATCGATA TACGCGGATT TGGTATCACC
GGCGACCAGA ACACCCTGGT TTTGCTGGAT GGTCAACGCA TGAACGAAAA TGAATTGACC
GGTATCCGCT GGTCGACGAT TCCTCTCGAT ACTATCGAAC GCATCGAAGT ATTGCACGGC
AGCGGTAGCG TCGTTTATGG AGCAGGAGCC ACCGGGGGTG TCATCAATAT CGTTACACGC
GCTGCCAAGC CGGGTCGCAT ATCGGGTACG GGGGGAGTTT CCTTCGGCAG CTACAGCGGC
ACGGAATTCA ACGGAACGTT CAACGCGGCA GGAAACAACG TTGGGCTGGC GCTCACGGCC
AATGCGTTGC GAACAGATAA CTACCGCGTG AATAACGCGT TGCGTCAATC GAATCTGCAG
GGCGATCTGC GTTACCTGTT CAGCAGCGGC AGGGCGTTGC TGAAGTTCGG ACTGGATGAT
CAGAAGCTGG AACTCCCTGC CAATCGCACT GAAGCACAGT TGCAAACCGA CCGGAAAGGC
ACCTCGACTC CTCGCGATTT CAGCACCCGC CAAGGTGCCT ACGTCACACT GAGAGGAGAG
AAAGAGACGG AATTCGGTGA TCTTGCCGCG GACCTTTCAT TTCGGGACAA CCACCGTACC
GCATTATTCG ACGATTACAA TCTGGACGGC TTCAACGCCA AAACATTCGT CAATTCCCGC
TCAGACCAGT GGCTGTTCAA CCCGAGGATG AAGCTGCCCT TTGCGGCTTT CGGCATGGAG
CACGAACTGG TACTGGGCGT GAATTTCGAA TGGTGGGATT ATCGATCGAG ACGGTTTACC
GGACCGGAAA CAGTACCAGG TTCGGCATCC GCCGATATCC CGAGAACGGA TGTAGTGGCG
ACCCAGCTAA ACCGTGCCAT TTATTTTCAA CACGCCTCCA CTTTACCTAA AGGTACCGTT
GTCAGCCTTG GAGGGCGACT GCAATGGGTA GACAACCGGG CGAATGACCG TTTCAATCCC
GCTGTCTACG CCAGCGGCCG GCAGAGCCGT ATGGTTTATG CCTATGATGC AGGTCTTCGC
CAGCCTCTGG GGGAAGCTTT TTCGGCCTAT GGCAGATTTG GGCGCAGTTT CCGGATAGCG
ACAGTGGACG AAAACTACAA CCAGTTTGGC GGTCCAATGT TCGATTCCAT AGTGAAGATT
CTCGAACCGC AAACTGCCCA TACTGGGGAG CTCGGACTGG ATTACAAGCA GAAGTCCCTA
CGTGCGCGCG CTTCGCTTTA CCGCACCAAC CTCAACAATG AAATCAGCTT CATAAACATA
GACCCATTCC TGTTTTTTGC CAACGTCAAT CTGCCGCCCA CGCGACGGCA AGGTTTCGAG
CTGGAAGGTT CATGGACGGT AACGGGCGCA CTCGATATAT TCGGCAGCTA TACGTTTATC
GATGCACGCT TTCGGAAGGG AGACTTTGGC GGTGTGGATG TATCCGGAAA TCTCGTTCCG
CTGGTGCCGC GCCACAAGGT CTCAGCGGGT GGTACATACC GGTTGGGATC CAGTACCCGT
GCGGCAGTAG TCATGAATTA TGTAGGTGAA CAGGTTTTCG ATAACGATCA AGCCAACACC
GCCACGGGTC GTATGCCGGA TTATGTCACG GTAGATCTCA AGGTGACTCA TCAGATGAGG
AAACTGCTGT TGAGCATGGC GGTGAACAAC CTGTTTGATG AAAAGTATTT TTCCTATGCC
ATTCGCAATT CAGCGGGTAC GAGCTTCAGT GCGCTACCTG CCCGGGAACG CAACGTCTGG
CTGACGGTAA AGTATCAATT CGACTGA
 
Protein sequence
MGLFRRHLLP WMFISVPSGA LAVSSDPAER LLPDVVVTAT RFSESIDKLP SGVSIVNAEQ 
IRNSAATTLP ELLQQLSGIH TRNVDGSPDP QIDIRGFGIT GDQNTLVLLD GQRMNENELT
GIRWSTIPLD TIERIEVLHG SGSVVYGAGA TGGVINIVTR AAKPGRISGT GGVSFGSYSG
TEFNGTFNAA GNNVGLALTA NALRTDNYRV NNALRQSNLQ GDLRYLFSSG RALLKFGLDD
QKLELPANRT EAQLQTDRKG TSTPRDFSTR QGAYVTLRGE KETEFGDLAA DLSFRDNHRT
ALFDDYNLDG FNAKTFVNSR SDQWLFNPRM KLPFAAFGME HELVLGVNFE WWDYRSRRFT
GPETVPGSAS ADIPRTDVVA TQLNRAIYFQ HASTLPKGTV VSLGGRLQWV DNRANDRFNP
AVYASGRQSR MVYAYDAGLR QPLGEAFSAY GRFGRSFRIA TVDENYNQFG GPMFDSIVKI
LEPQTAHTGE LGLDYKQKSL RARASLYRTN LNNEISFINI DPFLFFANVN LPPTRRQGFE
LEGSWTVTGA LDIFGSYTFI DARFRKGDFG GVDVSGNLVP LVPRHKVSAG GTYRLGSSTR
AAVVMNYVGE QVFDNDQANT ATGRMPDYVT VDLKVTHQMR KLLLSMAVNN LFDEKYFSYA
IRNSAGTSFS ALPARERNVW LTVKYQFD