Gene Apre_1618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1618 
Symbol 
ID8398430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1764321 
End bp1765271 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content40% 
IMG OID644995982 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003153360 
Protein GI257067104 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA CAAGAAGACT ATTATCAGTT TTGTTGCTTG CGATTTTCAT GATTACAGGA 
TGTAGCATTG AAGGTCAAAA CAACAAGGAA GAAAGCAAAG AAAGTGGAGT AGAAGAGAAA
GCGAGTGAGA CAAAAACTGA TGGAGATATG AAAATAGGAG TTTCCCTATC AACCCTAAAC
AACCCATTTT TCGTATCAAT CAGAGAAGGA GTTGAGGAAG CCGCAGGTAA AGAGAACGTA
GAGACAGTAA TTACAGATGC TCAAAACGAC TCATCTACCC AAAACAACCA AGTCGAAGAC
CTCATCACTC AAGGAGTTGA CTTAATAGTT ATTAACCCAG TAGATTCAAC AGCCATAGCT
ACATCAGTTG AGAAGGCAAA TGAAGCAAAC ATCCCAGTAA TCTGTGTCGA CAGAGGATCA
GACCAAGGTG AACTTGTAAG CTTCATAGCA AGTAACAACG TAGAAGGTGG AAAGCTTGCT
GGTGAATATA TACTAGAAAA AGTAGGAGAA AATGCCGAAG TAATCCAACT TGAAGGAATC
CCAGGAGCAA GCTCTACTCG TGAAAGAGGA GAAGGATTCG AAGAAGCTAC AAATGGTAAA
ATCAACCTAC TAGCTAGCCA AACAGCAAAC TTTGATAGGG CAGAAGGAAT GACAGTAATG
GAAAATCTTC TCCAAGCTCA CCCAGATGTA AAGGCAGTAT TCTGCCAAAA CGATGAAATG
GCACTTGGAG CAAGTGAAGC CATAAAAGCA AGCGGCAAAG ATGTAACTAT TGTAGGTTTT
GATGGAAATG AGGATGCAAT CAAGGCTGTA GAAGAAGGAA ATCTATCAGC AACAGTAGCC
CAAAAACCAA AAGAAATGGG TAAACTTGCA ATCGAAACAG CAATCAAATA CCTAAAAGGC
GAAGAGGTAG AAGAAACAGT AGACTCACCA CTAGAATTAA TCAAGAAATA A
 
Protein sequence
MQKTRRLLSV LLLAIFMITG CSIEGQNNKE ESKESGVEEK ASETKTDGDM KIGVSLSTLN 
NPFFVSIREG VEEAAGKENV ETVITDAQND SSTQNNQVED LITQGVDLIV INPVDSTAIA
TSVEKANEAN IPVICVDRGS DQGELVSFIA SNNVEGGKLA GEYILEKVGE NAEVIQLEGI
PGASSTRERG EGFEEATNGK INLLASQTAN FDRAEGMTVM ENLLQAHPDV KAVFCQNDEM
ALGASEAIKA SGKDVTIVGF DGNEDAIKAV EEGNLSATVA QKPKEMGKLA IETAIKYLKG
EEVEETVDSP LELIKK