Gene Ajs_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_2472 
Symbol 
ID4672234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp2631584 
End bp2632741 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID639839540 
Producttetratricopeptide repeat protein 
Protein accessionYP_986709 
Protein GI121594813 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.251188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.3368 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTG ACCTGAGCTG GATCTTCCTG GGGCTGCCGC TGGCCTTCGG GCTGGGGTGG 
TTCGCCTCGC GTTTCGACCT GCGGCAGATG CGCGAGGAGA ACCGCCGCGC ACCCAAGGCT
TACTTCAAGG GCCTGAACTA CCTTCTCAAC GAGCAGCAAG ACCAGGCCAT CGATGCCTTC
ATCGAGGCGG TGCAGAACGA CCCTGACACC ACCGAACTGC ACTTCGCCCT GGGCAACCTG
TTTCGCCGCC GCGGCGAGTA CAACCGCGCC GTGCGGGTGC ACGAGCACCT GCTGTCGCGC
GGCGACCTGA GCCGCGCCGA CCGCGAGCGC GCACAGCATG CGCTGGCGCT GGACTTCCTC
AAGGCAGGCC TGCTCGACCG CGCGGAAGAC GCACTGCGCC GCCTGGAGGG CAGCGCCTTC
GAGGGCCAGG CACGCATGGC CCTGCTGGCC ATTTACGAGC GTTCGCGCGA CTGGCCACAG
GCGTCGGACA TTGCACGGCG CATGCACCAT GCGCAGCAGG GCGACTTCAG CACCCGGCTG
GCGCACTACC TGTGCGAGCA GGCGCTGGCG CTGGCAGCCC ATGGCGAACT GCCCGCCGCC
CAGGCGCTGC TGGAGCAGGC CCTGGCCACG GCGCCCCAGG CGCCGCGCGC GCGCATCGAG
CTGGCGCGGC TGCAGCAGCG CCAGGGCCAG CCCGAAGCGG CCTTCGACAC CCTGCAAGCG
CTCGCCCAGG CCGCACCCGC CGCGCTGCCG CTGGCCGCAC CGCTGCTGGT GGAGACCGCC
ACCGCCACGG GACAAGCGCC GCAGGCCCAG GCGCTGCTGC AGCACCACTA CGAGGACATG
CCATCCCTGG ATCTGCTGGA AGCCGTGGTG GCGCTGGAGG CTGCCAACGC GAACACTGCG
GCCGTTGGGC GCGAGTGGTA CGTGCGCCAC CTGGAGCGCG AGCCCTCCCT GGTCGCCGCG
ACGAAGTGGC TGGCAGGCGA GACGCTGACC CATGAGCAGT TCCACCCGCA GATCCAGCGC
GCGCTGGAGC AGGCGGCAAA GCCGCTCACG CGCTACCGCT GCGCAGCCTG CGGGTTCGAG
GCACGCCAGC ACTTCTGGCA ATGCCCGGGC TGCCAGACCT GGGACAGTTA TCCGGCACGG
CGCGTCGAGG AGCTGTAG
 
Protein sequence
MEFDLSWIFL GLPLAFGLGW FASRFDLRQM REENRRAPKA YFKGLNYLLN EQQDQAIDAF 
IEAVQNDPDT TELHFALGNL FRRRGEYNRA VRVHEHLLSR GDLSRADRER AQHALALDFL
KAGLLDRAED ALRRLEGSAF EGQARMALLA IYERSRDWPQ ASDIARRMHH AQQGDFSTRL
AHYLCEQALA LAAHGELPAA QALLEQALAT APQAPRARIE LARLQQRQGQ PEAAFDTLQA
LAQAAPAALP LAAPLLVETA TATGQAPQAQ ALLQHHYEDM PSLDLLEAVV ALEAANANTA
AVGREWYVRH LEREPSLVAA TKWLAGETLT HEQFHPQIQR ALEQAAKPLT RYRCAACGFE
ARQHFWQCPG CQTWDSYPAR RVEEL