Gene Nmul_A0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0421 
Symbol 
ID3784171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp467209 
End bp469119 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content56% 
IMG OID637810497 
Productglycosyl transferase family protein 
Protein accessionYP_411121 
Protein GI82701555 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGAGC TGGCTGGAGT TGGGCGACGT CCCCTATTTC CGGATCACAG TCGTTTTGCA 
GTCCCAGGGT TATTGGCCGC GTTCCTCGAC GGACTGCTGT TCGAGTCGCT GATGAGCTAT
GGGGCAAACC TCGCCCTGTC GCATATCGTG GGCTTCCTCG CGGCTGCGAC CTTGCACTAC
TGGTTTATTT CGAGACCGCG GCTTTTATAC GACCTCGCGG GCTATTCCAG ATGGACTGAG
CTCGGCCGCT TCCTGGTTAT CTGTGTGCTG GCGCTTTCCG TGCGCGGCGG CCTATTAGCG
GTGCTAATCC AGGCCTGGGA TATACCGCCG GCTGCGGCCA TACTCCCTGC AATCGCGGCC
ACAGCCGTAA TGTTGTATCT GGGATCTGTT TTCTACGTGA TCCCCAGCGG GAACGCCCTA
CCCTCGCCTG AGGCAGACTG GCGCATCGCC TCTTTCGGGA TCATCGCATT TGTCATCCTG
TTGCGCCTCA TCTACATTGG CGTGGCGCAA CTGATTCCTG ATGAAGCATA TTACTGGAAT
TACGCCCAGC ACATGGATCT GAGTTTCTAC GATCACCCAC CTATGGTTGC CTGGTTGATA
TGGCTTGGCA CATCCATTTT TGGAAACAGC GAATTTGGCG TCCGGGTCGG AGCATTCCTG
TGCGGCTTGA TAACAATGGG GTATTTATTT GCGCTCACAC GCAATCTATA CGATAGCGCA
ACGGGGATTC GCGCTGTGCT GCTGCTTGCG ATTCTTCCCT TTTTCTTTGC TACTGGCGCA
TTGATGACGA CAGATGCGCC GCTCGTGGCC GCGTGGGCTG CAACTCTGTA CTATCTCGAG
CAGGCCCTGA TCGCCCAGCG GGAGCGGGCT TGGCTCGGGG TGGGCATTGC CTTCGGTCTG
GGCATTCTCT CCAAGTATAC CCTGGGTTTG GTGGGTATTG CCGCACTGGT TTTTGTGATT
ATCGACCCGG TTGCACGCCG CTGGTTGCGC CATCCTTATC CGTACCTCGC TGTACTGCTT
GCACTGCTCT TGTTCTCCCC CGTAATTATC TGGAACATGG AGCATGGCTG GATGTCGATA
TTGTTCCAGT CCGGCCGCGT CAGGGGCGTA GGAGATGACG AGTTCGGATT ATTCGAATTG
ATTCTTCATT TTGCCGTACT GCTTACGCCG ACCGGCTTGC TCGCTGCCGC GCTAGTATTG
CTGCCCGGCA CCCGGCAGAA TAATAGCGTC TCCATTTGCA GGCGCCGCCT GTTCATTATG
GTATTCACCG GCGTGCCGCT GGCGATCTTC CTGATTCTCA GCATTTTCGA CACCCAGCGG
TTCCACTGGA CCGGGCCGCT ATGGCTTATG GCACTGCCTG CCATGGCTTC GATGATGGGG
AAAATGCGAA ATTTCTCTGG TAACCCGACG ATTGCGGATC GGGTAATGGC AGCTTGGAGG
CCCACTATTC TGGCATGCCT GGTTTGTTAT GCGATTGCAC TGCACTACCT CGTTCTCGGC
TTCCCAGGCA TTCCCTACCA GGGCATATAT AAAGGTTTCC CCGAGCATTA TTTCTGGCGC
CAGGCAACGA TGGGCATCGA GCAGATTGTA GAGGATACGC GGCGGCAAAC CGGCCAGGAG
CCCATCGTCG TGGGCATGAG CAAGTGGTCC GTGGCCAGCA TCGTCACTTT CTATAATCGG
GGCAAACCCA TGGAAATACG TTCGCGCAAC ATATTCGGCG ACAGCGGCGC CATGTACGAT
CTCTGGTATC CGTCGGAATC CCCTACGACG CGACCCGTCA TTCTCGTCGG CATGTGGCAG
GATAATCTCG AATGCGTCCG AGACAGCATC GAGATCGGGA AGATGCTCGC CAATCCTGAT
CCGGTTCGGC AACTAGGTTC AGGACCTATT AATTTCATGA TTCACGAATA G
 
Protein sequence
MLELAGVGRR PLFPDHSRFA VPGLLAAFLD GLLFESLMSY GANLALSHIV GFLAAATLHY 
WFISRPRLLY DLAGYSRWTE LGRFLVICVL ALSVRGGLLA VLIQAWDIPP AAAILPAIAA
TAVMLYLGSV FYVIPSGNAL PSPEADWRIA SFGIIAFVIL LRLIYIGVAQ LIPDEAYYWN
YAQHMDLSFY DHPPMVAWLI WLGTSIFGNS EFGVRVGAFL CGLITMGYLF ALTRNLYDSA
TGIRAVLLLA ILPFFFATGA LMTTDAPLVA AWAATLYYLE QALIAQRERA WLGVGIAFGL
GILSKYTLGL VGIAALVFVI IDPVARRWLR HPYPYLAVLL ALLLFSPVII WNMEHGWMSI
LFQSGRVRGV GDDEFGLFEL ILHFAVLLTP TGLLAAALVL LPGTRQNNSV SICRRRLFIM
VFTGVPLAIF LILSIFDTQR FHWTGPLWLM ALPAMASMMG KMRNFSGNPT IADRVMAAWR
PTILACLVCY AIALHYLVLG FPGIPYQGIY KGFPEHYFWR QATMGIEQIV EDTRRQTGQE
PIVVGMSKWS VASIVTFYNR GKPMEIRSRN IFGDSGAMYD LWYPSESPTT RPVILVGMWQ
DNLECVRDSI EIGKMLANPD PVRQLGSGPI NFMIHE