Gene Ajs_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_2022 
Symbol 
ID4672787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp2128329 
End bp2129690 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content72% 
IMG OID639839100 
Product3-deoxy-D-manno-octulosonic-acid transferase domain-containing protein 
Protein accessionYP_986274 
Protein GI121594378 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCAGC TCATCCTGTG GCTCTACAGC CTGGCCGTGT GGCTGGCCAC GCCGCTGCTG 
CTGCGCAAGC TGCGCCGCCG CGCGCTCACC GAGCCCGGCT ACGCCGTGGC CGTGCCCGAG
CGCTTCGGCC ACTACCCGCC GCCTATGGAC AGCCTGTCGC CGTCATCGGA GACCGAAGCC
GACGAGCAGT TCATCTGGAT CCACGCCGTC TCCCTGGGGG AGACACGCGC GGCGGCCATC
CTGCTGAAGG AACTGCGCCC GCTGCTGCCC GGCATGCGCC TGCTGCTCAC CCACGGCACG
GCCACGGGCC GGGCCGAGGG CGAAAAGCTG CTGCTGCCCG GCGACGTGCA GGTGTGGCAG
CCCTGGGACA CGCCCTGGGC CGTGCGGCGC TTTCTGCGGC AGTTCCGCCC GTCCATCGGC
ATCCTCATGG AGACGGAGAT CTGGCCCAAC CTCGTCGCGG CCTGCCGCCG GCGGCGCATC
CCGCTGGTGC TGGCCAACGC GCGGCTCAAC GAAAAGTCGC GCGCCGGCGC GCGCCGGCTG
GGCTGGCTGT CGCGCCCGGC CTACGCCGGC CTGTCGGCCG TGTGGGCGCA GACCGAGGAT
GACGCCTCCC GGCTGCGCGA CGTGGGCGCG CAGGTGGCGG GAGTCTTCGG CAACCTGAAG
TTCGACGTGG TGCCGTCACC CACCCTGCAG GCGCAGGGCC GCACGTGGCG GGCCGCCAGC
GCCCGACCGG TGGTGCTGCT GGCCAGCAGC CGCGAGGGCG AAGAGGCCAT GTGGCTGGAG
GTTTTGAAGC AAAAAACGCC TATAACGCCC GCCAATCAAG CGCCAGCAGC TATTGATTCA
GGAGTAAATC AATCCGTGCA GTGGTTGGTG GTCCCGCGCC ACCCCCAGCG TTTTGACGAG
GTGCAGCGCC TGTGCGAGGC CGCCGGCCTG CGCGTGTCGC GGCGCAGCCA GTGGACCGCG
CAGCCGGACA GCGCCGATGT GTGGCTGGGC GACTCGCTGG GCGAGATGGC GCTGTATTAC
GGCCTGGCCC ACGTGGCGCT GCTGGGAGGC AGCTTTGCGC CGCTGGGCGG GCAGAACCTC
ATCGAGGCCG CGGCAGGCGG CTGCCCCGTG GTCATGGGCC CGCACACCTT CAACTTTGCC
GAGGCCGCGC GCCTGGCCAT CGACGCTGGC GCCGCCCTGC GCGTGGCCGA CATGGCCGAG
GGCGTGGCCG CCGCGACCGC CCTGGCGCAG GACCCGCAGC GGCGGAGGGC GCTGTCCGAG
CGCTGCGTGG CCTTCACCGA GGAGCACCGC GGCGCGGCCC TGGACACCGC CCTGGCCGTC
CTGCAGCGCC TGCGCGAGGC CGTGGACGAC CGTTCCGACT GA
 
Protein sequence
MHQLILWLYS LAVWLATPLL LRKLRRRALT EPGYAVAVPE RFGHYPPPMD SLSPSSETEA 
DEQFIWIHAV SLGETRAAAI LLKELRPLLP GMRLLLTHGT ATGRAEGEKL LLPGDVQVWQ
PWDTPWAVRR FLRQFRPSIG ILMETEIWPN LVAACRRRRI PLVLANARLN EKSRAGARRL
GWLSRPAYAG LSAVWAQTED DASRLRDVGA QVAGVFGNLK FDVVPSPTLQ AQGRTWRAAS
ARPVVLLASS REGEEAMWLE VLKQKTPITP ANQAPAAIDS GVNQSVQWLV VPRHPQRFDE
VQRLCEAAGL RVSRRSQWTA QPDSADVWLG DSLGEMALYY GLAHVALLGG SFAPLGGQNL
IEAAAGGCPV VMGPHTFNFA EAARLAIDAG AALRVADMAE GVAAATALAQ DPQRRRALSE
RCVAFTEEHR GAALDTALAV LQRLREAVDD RSD