Gene Apre_0383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0383 
Symbol 
ID8397157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp433346 
End bp435208 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content42% 
IMG OID644994741 
ProductPTS system, fructose subfamily, IIC subunit 
Protein accessionYP_003152153 
Protein GI257065897 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR00848] PTS system, fructose subfamily, IIA component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00944668 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAATTA GAGATTTACT CAAGCCTGAA CTGATGATAT TTGACCTTCA GGCAAATGAC 
AAGATGAGTG CAATTGAAGA AATAGCCTCA AAGTTTTTCG AAAAAGGTTA TGTAAAAGAT
AAAGAAGACT TCAAAAATGG ACTTATAGCA AGAGAAGAAG AAGGATCAAC TGCCCTAGGT
GAGTCAGTAG CCATCCCTCA CACCAAAAAC GAAACTGTTA AGGAACCTGC AGTTTTATTT
GCAAGAAAAG TAGGAGGACT TGACTATGAA GCCTTAGACG GAGAGCCAAC AGAAATATTT
TTCGCCATAG GAGCACCAGC GGGAGAAAAC AACCTCCATG TAGAAACCCT AGCCGAGCTT
TCAAAAATGA TTATGAAAGA AGGCTTCATC GATGATCTTA AGAAATGTTC TAGCGAAGAA
GAAGTCTACG GAGTAATCGA CAAATACTCA GAGAAAAAGA AGGCCCCAGT AGTTGAAGAA
ACTACAAATA GCGATATCAA ATTACTAGCA GTAACAGCTT GTCCAAACGG TATAGCCCAC
ACCTACATGG CCCAAGAAGC CCTAGAAAAG GCAGCCAAAA AGGCAGGGGT TTCAATCAAG
GTCGAGACAA ATGGGTCTGA TGGAATAAAA AATAGACTAA CAGCCAAGGA AATCGAAGAA
GCTGATGCTA TAATCATTGC AGCAGATAAG AAAGTCGAAA CAAATAGGTT CGATGGCAAA
AGACTTATCC AAAGACCAGT ATCTGACGGA ATCAGAAAAA GTGATGAACT AATCGAAAAG
GCCATCAAGG GCGAAGGAAG AATCTTTACC GCAGAAGAAG GAGCAAGTAA GGGAAGTGAT
GACGATGGCG AAGGTCAAAG CTTCTGGCAA AAGATTTACG GAGACCTAAT GAATGGAATT
AGCCACATGC TACCATTTGT AATCGGTGGT GGAATCCTCA TGGCAATCTC CTTCCTAGTA
GAAAGATTCG CAGGAGATGA ATCCCTTGCC TTCACTTTCC TAAATGGTCT TGGAGGGGAT
GCCTTTAGCT TCCTAATACC AATCCTTGCA GGCTTTATTG CCATGTCAAT TGGAGATAGA
CCAGCCCTAA TGCCTGGTAT GGTTGCAGGA CTTATGGCAA GCCGTGGAGC AGGCTTTATC
GGTGGACTAA TCGGAGGTTT CCTTGCAGGT TATGTAGTTA ATTTACTAAA GAAAGCCTTC
AGAAATCTTC CAAAATCAGT CGAAGGCCTA AAGCCAATGC TAATCTATCC AGTATTTGGT
CTTTTAATCG TTGGAGCCTT GATGTTCTTT ATCATAGACC CAATATTTAC TGGAATCAAT
ACCTTCATTA ACAACTGGTT AATGAGCCTA TCTGGAGCTA ATATGCTTCT TCTAGGAGCA
ATCCTTGCAG GCATGATGGC AATAGATATG GGTGGTCCTA TCAACAAGGC AGCTTACGCC
TTCGCAATCG GAGCCTTCAC AGATACAGGA ATAGGAACCT TTATGGCAGC TGTAATGGTT
GGAGGAATGG TTCCACCAAT TGCAATAGCA ATAGCAACAA CATTTTTCAA AGATAAATTT
ACAGAAGATC AAAAGAAGAC TACAATTACC AACTACATCT TGGGTCTAAG CTTTATAACA
GAAGGAGCAA TTCCTTTCGC AGCTGCGGAA CCAACTAAGG TAATCCCAGC TAGTGTTATA
GGATCAGCTA TAGCAGGAGC AATAGTTGGA GGCTTTAACA TATCAGCCCC AGCCCCACAC
GGAGGAATCT TCGTATTGCC AGCTATGTCA AGCCTAAGCC AAGCCCTAAT CTTTGTAGGA
TCTGTATTAG TAGGTGCAAT AGTTGGTGGT CTAATCTACG GATTTATCAA AAAGAAAGAT
TAA
 
Protein sequence
MEIRDLLKPE LMIFDLQAND KMSAIEEIAS KFFEKGYVKD KEDFKNGLIA REEEGSTALG 
ESVAIPHTKN ETVKEPAVLF ARKVGGLDYE ALDGEPTEIF FAIGAPAGEN NLHVETLAEL
SKMIMKEGFI DDLKKCSSEE EVYGVIDKYS EKKKAPVVEE TTNSDIKLLA VTACPNGIAH
TYMAQEALEK AAKKAGVSIK VETNGSDGIK NRLTAKEIEE ADAIIIAADK KVETNRFDGK
RLIQRPVSDG IRKSDELIEK AIKGEGRIFT AEEGASKGSD DDGEGQSFWQ KIYGDLMNGI
SHMLPFVIGG GILMAISFLV ERFAGDESLA FTFLNGLGGD AFSFLIPILA GFIAMSIGDR
PALMPGMVAG LMASRGAGFI GGLIGGFLAG YVVNLLKKAF RNLPKSVEGL KPMLIYPVFG
LLIVGALMFF IIDPIFTGIN TFINNWLMSL SGANMLLLGA ILAGMMAIDM GGPINKAAYA
FAIGAFTDTG IGTFMAAVMV GGMVPPIAIA IATTFFKDKF TEDQKKTTIT NYILGLSFIT
EGAIPFAAAE PTKVIPASVI GSAIAGAIVG GFNISAPAPH GGIFVLPAMS SLSQALIFVG
SVLVGAIVGG LIYGFIKKKD