Gene Ajs_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_4012 
Symbol 
ID4671795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp4271170 
End bp4272879 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content72% 
IMG OID639841052 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_988192 
Protein GI121596296 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCC AGCCCCTCCA TGACAGCGGA CCGCTGAGTT CACAGCCCAC GCCGCCAGGC 
CCCGACGCGG CCTCCGCCGC CGCCCACACC GCCGAGGCCG CAGCCACAGC CGAGTTGCAG
CACAGCGCGG CGCAGCGTGC GCCGCAGTCG CCCTTCGCGC CGCTGTCGGT GCCGGTGTTC
CGCATGCTGT GGCTCACCTG GCTGGCGGCC AACACCTGCA TGTGGATGAA CGACGTGGCC
ACGGCCTGGC TGATGACCAC GCTGACCGAT TCGCCTGCCC TGGTGGCGCT GGTGCAGACG
GCCTCGACGC TGCCCGTGTT CCTGCTGGGC TTGCCCAGCG GCGCGCTGGC CGACATCCTG
GACCGGCGGC GCTATTTCAT GGTCACGCAG TTTTGGGTGG CGGCCGTGGC GGTGGTGCTG
TGCGTGGCCA TCCTCTGGGG TGGGCTCAAC CCCTACCTGC TGCTGGCGCT GACGTTCGCC
AACGGCATCG GGCTGGCGAT GCGCTGGCCG GTGTTTGCGG CCATCGTGCC GGAGCTGGTG
AACCGCCAAC AGTTGCCCGC GGCGCTGGCG CTCAATGGCG TGGCCATGAA CGCCTCGCGC
ATCATTGGCC CGCTGGTGGC CGGCGCCATC ATCGCCAGCG CGGGCAGCGC CTGGGTGTTC
GTGCTGAATG CCGTGCTGTC GCTGGTGGCG GGGTTCACCA TCATGCGCTG GCGGCGCCAG
CCCATGCCCA ACCCGCTGGG GCGCGAGCGC CTGACCAGTG CCATGCGCGT GGGTCTGCAG
TTCGTGCGCG AGTCGCCCCC GATGCGCGCC GTGCTGTGGC GCATCTCGAT CTTCTTCCTG
CATGCCACGG CGCTGCTGGC GCTGCTGCCG TTGGTGGCGC GCGACCTGCA GGGCGGCGGC
GCGGGCACCT TCACGCTGCT GCTGGCCTCC ATGGGCGCGG GCGCCGTGAG CGCTGCCATG
TTCCTGCCGC GCCTGCGCCA GATGATGTCG CTGGACCAGC TGGTGGCGCG CGGCACGCTG
CTGCAGGCGC TGGCCACGGC CGTGGTCGCC ATTGCGCCCA ACGTGTACGT GGCGGTGCCG
GCCATGCTGG TCGGCGGGGC GGCGTGGATC ACCACCGCCA ATTCGCTCAC GGTGGCCGCA
CAGCTGGCGC TGCCCAACTG GGTGCGCGCG CGCGGCATGT CCATCTACCA GATGTCCATC
ATGGGCGCCA CGGCCGTGGG CGCGGCGCTG TGGGGGCAGG TCGCCGCACT CTCCAGCGTG
CACATGAGCC TGGCATTGGC GGCACTCACC GGGGTGCTGG TGATGGCGCT GGTACAGCGC
CTGGTGAGCA ACCGCCATGG CGAGGAGGAC CTGAGCGCCT CGCGCGCCTT CCAGGCACCG
CGGGCCGACA GCCCGCCCGC CGCCGGCCTG CGGCTGGTGG TCAGCATCGA ATACTTCATC
AACCCCGCAC GGGCGGCGGA ATTTCGCGCC GTGATGCAGG AAAGCCGCCG CGCGCGCCTG
CGCCAGGGCG CCTTGAGCTG GGAGCTGCAG CACGACATCG CCGACCCGCG CCGCTACGTG
GAGCGCGTGG TGGACGAATC CTGGACGGAG CACCTGCGGC GCTTTGACCG CGTCACCGCC
TCCGACGTGG CGCTGCGCGA CAGGCGCTTT GCCTTCCACG TGGGCGACGC GCCGCCCGTG
GTGTCGCGCT ACGTGGTCGA GGGCGAATGA
 
Protein sequence
MPPQPLHDSG PLSSQPTPPG PDAASAAAHT AEAAATAELQ HSAAQRAPQS PFAPLSVPVF 
RMLWLTWLAA NTCMWMNDVA TAWLMTTLTD SPALVALVQT ASTLPVFLLG LPSGALADIL
DRRRYFMVTQ FWVAAVAVVL CVAILWGGLN PYLLLALTFA NGIGLAMRWP VFAAIVPELV
NRQQLPAALA LNGVAMNASR IIGPLVAGAI IASAGSAWVF VLNAVLSLVA GFTIMRWRRQ
PMPNPLGRER LTSAMRVGLQ FVRESPPMRA VLWRISIFFL HATALLALLP LVARDLQGGG
AGTFTLLLAS MGAGAVSAAM FLPRLRQMMS LDQLVARGTL LQALATAVVA IAPNVYVAVP
AMLVGGAAWI TTANSLTVAA QLALPNWVRA RGMSIYQMSI MGATAVGAAL WGQVAALSSV
HMSLALAALT GVLVMALVQR LVSNRHGEED LSASRAFQAP RADSPPAAGL RLVVSIEYFI
NPARAAEFRA VMQESRRARL RQGALSWELQ HDIADPRRYV ERVVDESWTE HLRRFDRVTA
SDVALRDRRF AFHVGDAPPV VSRYVVEGE