Gene Nmul_A0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0414 
Symbol 
ID3784164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp458663 
End bp460564 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content56% 
IMG OID637810490 
Productglycosyl transferase family protein 
Protein accessionYP_411114 
Protein GI82701548 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCAGA CATACTCTCC CTCAGTCAGC GTAGTAGTTC CCACTTATGA GCAAGCACGA 
TTCATTGGCC GCGCGTTGGA CAGCCTGCAG GCGCAGGTTT TGACGGATTG GGAAGCAGTC
GTCATCGACG ATGGTTCACG GGATGCTACA GCCGAAGTCG TATCGGCATA CCTGGGCGAC
ACCCGCATTC ATTACTATCG CTTTCCCGAA AATCGGGGGT TGGGTCGCAC CCTGAATGAA
GGCATTGCCA AAGCAAAAGC GCCGCTCATC GCTTACTTGC CAACCGATGA TGTTTATTAC
CGCGATCACC TGGGCAGCCT GAAAACATGC CTCGAGACGC AGAAAGGCGC TGTGTTGGCT
TGTTCAGGCG TTCGCCACCA TTACAACCGC GAGGCCATTG GGCAAATTCC GGAATTTCCT
TTGCAACTGG TGCAGTGCAT GCATCGGAAA ATGCCGGTGC GGTGGGTGGA GCGCGCAGAA
TTGGAGTCGG ATGATCTTGA GCGCCTCTAC TGGAGCCTGC TGCGGCCCCT CGGCGCTTTT
GCCGAAAGCG GAACTCTTAC CTGCGAGTGG GTCTCCCACC CGATGCAACG CCACAAGATC
ATGCAGGAAC CGGAAGGCGG CATCAACACG TTTCGCTCCC ATTATCGGGT CAAAGAACCG
CTGCGCTTTC ATACAACCGT AGGCCATCGG ATCGATGAAG CGGACCATTA CCGAAAAATG
CGCGAACGGC CGGATACTCC TCGTGCCGAC AACGGGTTGA AAATACTTCT GGTAGGAGAA
CTGGCCTATA ACGCCGAGCG CGTGCTTGCC CTGGAAGAGC GGGGACACAA GCTGTATGGC
CTGTGGATGC AGAACCCGTA CTGGTACAAC ACGGTAGGGC CGATGCCTTT TGGGCATGTG
GAGGACCTGC CGCGCGACAA CTGGCGCGAA GCAGTGAAGC AGGTGCAGCC TGATGTTATC
TATGCATTGC TCAACTGGCA GGCAGTGCCG TTTGCACATG AGGTATTGAT GGCCACGCAC
GGCATTCCTT TCGTCTGGCA TTTCAAGGAA GGTCCTTTCA TCTGTCTTGA AAAGGGAACC
TGGCCGCAAC TGATCGACCT GCACCGGTAT TCGAACGGCC AAATCTTCTC CAGCCCCGAA
ATGCGGGATT GGTTTGATAC CATCATTCCC GGTTTGTCGC TGGATAAACC CACCCACGTG
CTGGATGGAG ACCTGCCGAA GCGTGACTGG TTTGATCAGC CTCGTGCGCC GCTGCGTTCT
GAAAGCGAAG GGGATATTCA TACCGTATCA CCCGGACGAC CCATCGGGCT GCATCCGCAT
CATGTTGCGG AGTTGGCCCA CCATGGCATT CATCTGCATT TTTATGGGGA GATAACCCAT
GGACAATGGC TGCAATGGAT TGAAAAAACC CAAGCAATCG CGCCAGACTA CCTGCATTTA
CACCCGAACG TGGATCAAAG TCACTGGACT GCCGAGTTCT CCCAATATGA CGCGGGCTGG
CTGCATCTCT TCGAAAGCAG CAACAGAGGG GAAATCAGGC GGGCAAACTG GGATGACCTG
AACTACCCGG CAAGGCTGAG TACCCTTGCA GCGGCCGGCC TGCCGATGCT GCAGAAGGCG
AATGAGGATG CTCTCGTCGC CACTCGTACA TTAGCAAAGC AGCTTGATAT TGGAATTTTT
TTTGATACCG TGGATGAGCT CGCCGCACAG TTGCGCGACT GCAAGCAGTT GCGAGCGACG
CGTGAGCGGG TCTGGCAGCA GCGGCACCTG TTCACATTCG ATCATCATGT CCCAGAACTG
GTTGATTTCT TCCGCCTCGT AATAGCGTCC ACTTCCAGGA AACAGACACG CACCGGAACT
ACAGTAGAAT CTGCTCATCC GCAAAGCCGC AGGCTGGCAT GA
 
Protein sequence
MPQTYSPSVS VVVPTYEQAR FIGRALDSLQ AQVLTDWEAV VIDDGSRDAT AEVVSAYLGD 
TRIHYYRFPE NRGLGRTLNE GIAKAKAPLI AYLPTDDVYY RDHLGSLKTC LETQKGAVLA
CSGVRHHYNR EAIGQIPEFP LQLVQCMHRK MPVRWVERAE LESDDLERLY WSLLRPLGAF
AESGTLTCEW VSHPMQRHKI MQEPEGGINT FRSHYRVKEP LRFHTTVGHR IDEADHYRKM
RERPDTPRAD NGLKILLVGE LAYNAERVLA LEERGHKLYG LWMQNPYWYN TVGPMPFGHV
EDLPRDNWRE AVKQVQPDVI YALLNWQAVP FAHEVLMATH GIPFVWHFKE GPFICLEKGT
WPQLIDLHRY SNGQIFSSPE MRDWFDTIIP GLSLDKPTHV LDGDLPKRDW FDQPRAPLRS
ESEGDIHTVS PGRPIGLHPH HVAELAHHGI HLHFYGEITH GQWLQWIEKT QAIAPDYLHL
HPNVDQSHWT AEFSQYDAGW LHLFESSNRG EIRRANWDDL NYPARLSTLA AAGLPMLQKA
NEDALVATRT LAKQLDIGIF FDTVDELAAQ LRDCKQLRAT RERVWQQRHL FTFDHHVPEL
VDFFRLVIAS TSRKQTRTGT TVESAHPQSR RLA