Gene Nmul_A2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2152 
Symbol 
ID3784392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2445904 
End bp2447067 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content53% 
IMG OID637812240 
Productglycosyl transferase, group 1 
Protein accessionYP_412837 
Protein GI82703271 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATCA TTTCAACCGC CGAACGGCCT GTCAATCTTC TTGAGATCAT AGGGGCTTCC 
ATTGTCGGTG GAATGGAAAC TTACGTCTTG AGGTTGCTCG AGCGGCTTCC CCAGGACTCT
TTTCGGGTCA CCTGCCTCTG TGTGGCGGAA GGCAAACTCG CATCCCAATT ACGCAGCATC
GGTTGCAGCG TTCACATCAC GCCCATCACG GATGAGCCCG ACTGGCAGGC TGTCCTGCTG
GGCTCCAGCC TGATCCGGAC TGATGCGATC GACGTCATTC ATGCACACCT TCCCAATGCA
CACTCACTTG CAGGTATTCT GAGCAGGTTG ACCGACACGC CAGCTGTGTC CACTATTCAC
GGACGCTACC TGAGCATGCG CGATTTTGAA GTGCACAAGC TCATGAATAC CCATATCAAC
GTGGTTGCAA AAACCGCCTA CTTCCATGCT TTGACCCTCG GCGTACCGAG CACAAAACTA
CGTTTTATTC CGAATGGAAT CGATACAAAG ATCTTTCAAC CGGCCCCCAA GTCGAATTAC
CTTCATTCCT TGATCAAGAT CCCGCCAGAA ACGCCTCTCG TAGGCTTCAT TGGACGCCTC
TCGCCCGAAA AGGGACCGGG AGTATTTGTT CAGGTGGCGC GGATAGCACA AAGGAAATTG
AAGAATTGCC ACTTCGTACT GGTGGGCGAA GGGCCGATGC GGCGGGAGCT CCAGAAGGAA
ATCGACGAGT ATGGCCTGAA GGACCACATC CATATAGTCG GACTGCAGAG AGACATCACA
AAAATCTATC CCTGCCTCGA TCTCGTCGTG TCGACTTCAT ATTCCGAAGC GATGCCACTT
GTAATAGTAG AAGCGATGGC GTCAGGCCTC CCGGTGGTGG CCACGAATGT CGGGGGGGTG
GTCGATATTG TCGAGGTAGG CGGAACCGGA TTGCTGAAAG GTCCGGGAGA TACGGAAGGA
CTGGCAAACG ATGTCATCAC TTTGATGACC GACAATTCGA CCCGTATCCA GATGGGAGCA
GCAGCCCGAA AACGGGCCGA AGAAAAATTC GACCTAAGTG ATATCGTCGC TCAAACTGCA
CAATTATTGC GATCTCTTAC CCAAGGGGGT ATCAAAGGAA GCGATGCGGA TACGACAAAA
GGCTATCGAA AAGCGCGTTC CTGA
 
Protein sequence
MKIISTAERP VNLLEIIGAS IVGGMETYVL RLLERLPQDS FRVTCLCVAE GKLASQLRSI 
GCSVHITPIT DEPDWQAVLL GSSLIRTDAI DVIHAHLPNA HSLAGILSRL TDTPAVSTIH
GRYLSMRDFE VHKLMNTHIN VVAKTAYFHA LTLGVPSTKL RFIPNGIDTK IFQPAPKSNY
LHSLIKIPPE TPLVGFIGRL SPEKGPGVFV QVARIAQRKL KNCHFVLVGE GPMRRELQKE
IDEYGLKDHI HIVGLQRDIT KIYPCLDLVV STSYSEAMPL VIVEAMASGL PVVATNVGGV
VDIVEVGGTG LLKGPGDTEG LANDVITLMT DNSTRIQMGA AARKRAEEKF DLSDIVAQTA
QLLRSLTQGG IKGSDADTTK GYRKARS