Gene Acry_2216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2216 
Symbol 
ID5160148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2449820 
End bp2450857 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content62% 
IMG OID640554138 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001235333 
Protein GI148261206 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000045016 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATTT TCAGGAGCGC TTCGGCGCTT GGCGCGATGG CGGCCGTCTT GAGCTTCGTG 
GGAGTCGCGC GGGCGGCGCC GGCCCTGGAT ACGGCAAAAA TGACCCAGAT GCAGTCGAAG
GCGGTGTGCG TGAATCCGCA CCCTACGCAT GTCAATTTGT CGAAGCTCGT AGTCGGATTC
AGCCAGTCTG AATCGAATGC CAATCCGTTC CGAGCCGGTG AGACCAAGTC GGTCCGGGAT
GCGGCGAAAG CGTTTCATGT CCGCCGGCTG ATCTATACAA ACGCGCATAG CAACCAGTCG
CGCCAGGTCG CCGATGTCGA GAACATGATC AACCAAGGCG CGCAGGCGCT GATCATCGCG
CCGCTGGATT CGACCGGCTT GCAACCGGCT TTCGCGCAGG CCGCAGCCAA GCACATTCCC
ATCCTCACCC TCGACCGGCG GACCGCGGGC TCGAAGTGCA GCGATTATCT GAGCTTCCTG
GGCTCCAACT TCTATTTCAA GCAAGGCGAG ATCGACGCGC GAGAACTAGC GAAGGCGACC
GGCGGCCACG CGATGGTGGC GGAGATTCAG GGCGCCTACG GCAATTCGGT GGAGGTGGCG
CGCACCAAGG GCTTCGCCGC TGGGCTCAAA GCCTATCCTG GCATGAAGAT CGTCACCGAG
CAGACCGGTA ACTGGTTCAC CACCGACGCG CAGAAGGTGA TGAGCCAGAT TCTGCTCGCG
CATCCGAATG TGAATGCGGT CTATGCCCAA GCGGATACGA TGGCGTTCGG CGCGATCACC
GCGCTGCGCG ACGCCGGCAA GAAGCCGGGA CAGGTCAAGA TCGTGTCGAT CGACGGCACC
CGGCAGGGGG TTCAGGACAT CGTTGACGGC TGGATCTATG CCGATGACGA AACCAATCCG
CGCTTCGGGC CGATCGCGTT TCACGAGCTG CAGAACTGGT TCGACGGTAA GCCGGTGCCG
CGGCACATCG TGCTGACGGA TCATATCTAC ACCCCGGCGA ATGCAGCGGC GGCGCTGAAG
AACAACGTGC CGTTCTAA
 
Protein sequence
MMIFRSASAL GAMAAVLSFV GVARAAPALD TAKMTQMQSK AVCVNPHPTH VNLSKLVVGF 
SQSESNANPF RAGETKSVRD AAKAFHVRRL IYTNAHSNQS RQVADVENMI NQGAQALIIA
PLDSTGLQPA FAQAAAKHIP ILTLDRRTAG SKCSDYLSFL GSNFYFKQGE IDARELAKAT
GGHAMVAEIQ GAYGNSVEVA RTKGFAAGLK AYPGMKIVTE QTGNWFTTDA QKVMSQILLA
HPNVNAVYAQ ADTMAFGAIT ALRDAGKKPG QVKIVSIDGT RQGVQDIVDG WIYADDETNP
RFGPIAFHEL QNWFDGKPVP RHIVLTDHIY TPANAAAALK NNVPF