Gene Acry_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_3003 
Symbol 
ID5160915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp3284978 
End bp3286306 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID640554933 
Productextracellular solute-binding protein 
Protein accessionYP_001236112 
Protein GI148261985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCC GCACCCTGAT CGCCTCGGCC CCGGCCACGG CGCTGGTCGC CGCAACCGCG 
CGCGCCGCAG CGCCCACCCG GGTCGTGCTC TGGCACGCCA TGGGTGGCGC GCTCGGCGAC
AAGCTGCAGT CGATCGTCGA CGCCTTCAAC AAGAGCCAGG CGGATTTCGC CGTCGACGCA
GTCTACAAGG GCTCCTACCC GCAGGTGCTG ACGGCCACCA TCGCCGCCTG GCGCGCCGGC
AAGGCGCCCT CGATCGCCCA GGTGTTCGAT GTCGGCACCG CCGACATGCT CGGCGCCGGC
CGCGCGGTGG TCGATGTCTA CAAGCTCGCG GAGATGACGG GCGTGCCGAT CGAGGCATCC
ACCTATATCC CTGCGGTGCG CGGCTACTAT AGCCTCAACG ATGGCAAGAT GGGCGCTGCA
CCGTTCAACT CCTCGACCGC GCTGATGTGG ATCAACGAGG ACGCCTTCGA GAAGGCCGGG
CTCGACCCCA AGGCCCCGCT CGCGACCTGG GACGACGTGA TCAAGGCCGC CCGCGCGGTC
AAGGCGAAGG GTGCCGCTGA AATCCCGGTG ATGACCTCAT GGCCGACCTG GGTGCATTTC
GAGCAGTTCG CCGCCATCCA CAATGTCGAA TACGCCACGC TGAATGACGG TTTCGGCGGC
CCGAACCCGA AACTGCGGCT CGACTCGCAT CCCTTCGTGC GCAATCTCGA CACGCTGCTG
ACGATGCAGA AGGAAGGCCT GTTCCACTAC GAGGGCCGCG ACGGCAAGCC GAGCCCGATC
TTCTACGCCG GCAAGGCGGC GATCACCTTC GACAGCTCCT CGATCTACGG CCAGCTGGTG
AAGAGCGCGA AGTTCCGCTT CGCCAACGCC TACCTCCCCT ACCATCCCTC GATCATCAAG
AGCCCGATCA ACTCGATCAT CGGCGGCGCC GCCTTCTGGG CGATGACCGC GCCGGGCCGT
GGCAAGGCGG AATACGAGGC CGTGGCGCGC TTCTTCAAGT TCATCTCCGA ACCGCAGAAC
GATGCCGGCT GGGCCGAGGC CACCGGCTAT GTGCCGGTAA CGCTGGCGGG GAACGACTAC
ATCGCCAAAC AGGGCTTTTA CGCAAAGCAG CCGGGCGGCG ACCTCGCGGT CAAGCAGCTC
ACCCGCACGG AGCCGACGAA GTATTCGCGC GGCATCCGCC TGGGCGGCAT GCCGGAAGTC
CGCGTGATCA TCGAGGAAGA GTGGGAAAGC GCGATCCAGA ACGGCACGCC GGCCCACAAG
GCGCTGGCGC GCGCCCAGCA GCGCAGCCAG GCGGTGGTCG ACCGCTTCGC CCGCGCCCTG
CACGGCTGA
 
Protein sequence
MKRRTLIASA PATALVAATA RAAAPTRVVL WHAMGGALGD KLQSIVDAFN KSQADFAVDA 
VYKGSYPQVL TATIAAWRAG KAPSIAQVFD VGTADMLGAG RAVVDVYKLA EMTGVPIEAS
TYIPAVRGYY SLNDGKMGAA PFNSSTALMW INEDAFEKAG LDPKAPLATW DDVIKAARAV
KAKGAAEIPV MTSWPTWVHF EQFAAIHNVE YATLNDGFGG PNPKLRLDSH PFVRNLDTLL
TMQKEGLFHY EGRDGKPSPI FYAGKAAITF DSSSIYGQLV KSAKFRFANA YLPYHPSIIK
SPINSIIGGA AFWAMTAPGR GKAEYEAVAR FFKFISEPQN DAGWAEATGY VPVTLAGNDY
IAKQGFYAKQ PGGDLAVKQL TRTEPTKYSR GIRLGGMPEV RVIIEEEWES AIQNGTPAHK
ALARAQQRSQ AVVDRFARAL HG