Gene Nmul_A0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0243 
Symbol 
ID3785729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp260176 
End bp261567 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content51% 
IMG OID637810318 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_410943 
Protein GI82701377 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03017] chain length determinant protein EpsF 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTTC AACAATTTTT ACTCATCCTT CGTGCCCGGT ATAAGGTAGT CCTCTATATT 
TTATTGTTTA CGGTAATCGC TACCCTTGTA GTCAGTTTGC TATTGCCTAA ACAATATACC
GCTGGTACAG CTGTAGTGGT TGACATTAAA TCGCCCGATC CGGTAGCGGG GATGGTATTG
CCCGGTTTGA CGTCACCTAC TTACATGGCT ACCCAGGTCG ACGTGATCAA TAGCGACCGT
GTAGCAGAGC GTGTCGTAAA AATGCTGCGC CTGGATGAAA GCCCTGTCGT AAAGGAGCAG
TGGGAGGAGG ACGGAAGCAA GGGTGAGCTT ACCACCTGGC TGGCCAACTT GCTGAAAAGA
AAGCTTGATG TCAAACCTTC CCGGGAAAGC AACGTAATCT ACATTGAGTA TACCGGGAAT
GATCCCAATT TTGCCGCCGC TGTTGCAAAC GCTTTCGCTC AAGCTTATAT CGACGTGAAT
CTGGATTTGA AGGTCGCCCC TGCCCGCCAG TATGCCCACT GGTTCAAAGG ACAAACTGCC
GCTGCCAGGG ATGAGCTAGA GCGAGCCAAA GCTGCACTTT CCGCGTATCA ACAGGAAACC
GGTCTCGTCG CTACCGAGGA ACGTCTGGAT TACGAAGTAG CCAAGCTCAA TGAACTTTCA
AGTCAACTTA CTCTTATCCA AGCCCAGACA TCGGACAGTA GCAGTAAACG GAAAGCCGCT
GAAGATCCGG ACACTTTAGC TGAGGTCATA CAGAGCCCGC TAATAAATAG CCTCAAATCG
GATATTGCGC GCCTCGACGC AAAGCTTCAG GAAAGCAGCG CCTATCTGGG ACCAAATCAT
CCACAAACCA AACGCACTCA ATCTGAACTC GCGTCTCTCA GAGGCAAGCT ATCCGCTGAA
ACACGCAAGA TTCATAGCAG TATCGGCACT TCATATGAAG TGGGGAAGCG AAAGGAGCAA
GAGTTGCTCG AGGCGATGGA AAGGCAGAAG GGGCGCGTGT TGGAGCTTAA CAGACAGCGT
GATCAGATGA GTGTTCTTCA AGGGGATGTG GAAGCTGCGC AACGCAATTT CGAAGGCATA
AGCCAGCGTA GTGCCCTCAC CCGTCTGGAA AGCCTTTCCG TTCAGACTAA CATAACCCCG
TTGAACCCTG CTTCAGTCCC GAGTGAGCCG TCAAGCCCTA AATTGCTGCT CAATACATTG
ATCTCCATCT TTCTAGGTAC GCTGTTGGGT GTGAGCGCCG CTCTGGTTAT GGAATTGATG
AATCGACGCG TGCGTTCTGT AGAGGACATC GTGGAAGCCA TAGAGATTCC GGTGCTGGCG
GTAATGTCCG GCCCGTCTCG CAACAAGCTT TCGAGTCGTC TTCCCAAATT ACCCAACCCG
GAAACCAGTT AA
 
Protein sequence
MTLQQFLLIL RARYKVVLYI LLFTVIATLV VSLLLPKQYT AGTAVVVDIK SPDPVAGMVL 
PGLTSPTYMA TQVDVINSDR VAERVVKMLR LDESPVVKEQ WEEDGSKGEL TTWLANLLKR
KLDVKPSRES NVIYIEYTGN DPNFAAAVAN AFAQAYIDVN LDLKVAPARQ YAHWFKGQTA
AARDELERAK AALSAYQQET GLVATEERLD YEVAKLNELS SQLTLIQAQT SDSSSKRKAA
EDPDTLAEVI QSPLINSLKS DIARLDAKLQ ESSAYLGPNH PQTKRTQSEL ASLRGKLSAE
TRKIHSSIGT SYEVGKRKEQ ELLEAMERQK GRVLELNRQR DQMSVLQGDV EAAQRNFEGI
SQRSALTRLE SLSVQTNITP LNPASVPSEP SSPKLLLNTL ISIFLGTLLG VSAALVMELM
NRRVRSVEDI VEAIEIPVLA VMSGPSRNKL SSRLPKLPNP ETS