Gene Apre_0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0131 
Symbol 
ID8396882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp155387 
End bp157417 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content38% 
IMG OID644994469 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003151904 
Protein GI257065648 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATACG AAAAAAACCT TATAGATGAT ATTGGTGAGA TTCGTCCTAG AAAGAACGAG 
TATATAATAG ACTCTATTAA GGCAGAAGAA AGTGGCGGAA ATAAAGTAGA TGCCAAAAAC
CATCCTTATA ATCAAAAGCT AAAAAAGTAC GAAGAAGAAA AGAAAGAACT TTTGGCTAAG
GCCGATGAAG CTGCCAAGAA AGATCCAAAT TACAAACTCG ACCAGAAATA CTTGAGGGAC
TTGTATTATA GGAAGTTTAT GGCGAACTAC TGCCTGGACT TTTATGAAAA AAACAAGGAC
CTATCCTATG ATAGCGAGCT TGATTATAAG TTATGTAAGT TAGAATATGA GCAAATACCA
AAGATTATAG AACATGATTT GCTCCTAAAA AGTCAGCTAG AAAGAGCTAG CAAGAGGCTA
GATAAGCTAA CTAGAGAAGA GATAGAAAGC GCTAAAAAGC TAAGCGAAGA AGATAGACTT
GTTCTTAAGG AAAAATTTGA ATCAGATAAC AAGAGCCTAG AAGAATCTTT CAACAAGGGA
AGAATTTCCA AAAAGGCCTT CAAGAGCGAG AAAGAACAAC TAAAACAAAA GTTCAAAGAC
CAGAACAAGA GACTTAATTA CAGAAATCCA GAAGTTTCTC TTAAGGAAGA AATTGAATCA
ATCAAATACA AGATAGACAA AGACTACAAG AAGGAGATGA AAATCCTTGA GGCAGATGAG
GCTGAAGCTA GGAGGAGAAC TCCAGTAGAG GTAGAGAAAA CTTCTGCCTA TAGGTCAATC
CTTACCTTCC CAATCCCTGG TCTTGGTCAA ATCCTAAACG GCCAGTGGCA AAAGGGCTTA
TTATTTTTAC TAGGTACTTT ATTCATCTAC CTAATAGCAA TCCCTTACGC CCTAGGTTTT
GGTAACTACC AAGGTGATGG TGTAGCAGGA CTTATCTCTC TTGCCCAAGG CGGAAAGAGA
CTTGATAGAT CAATACTCTT TATGATTGAG GGTATCTTAT CTATAGTTTT TGTAGCAATT
GCTGCCCTAA TCTACATCCT ATCCTTTAAG GATGTTAGGA CAACTGAGAA AAATGAAATG
AAGGGTATCA GACCTAACAA CTTCTTTGAG ACCAAGAAGA TGTTAAGGAC TGACGGATTC
CCATTCTTAA TTACAGCACC AGCCCTAATA GTAATTGTGT TTATAGTAAT AGTTCCAATA
CTTACAGCTA TAATGATTTC CTTTACTAAC ATGGACCCAC AACATCAAAA CAAGTTCACT
TGGGTTGGCC TAAACAACTA CTTGACCATA GCTAAGGGCC AAGGTATAGC AGGACAAGCC
TTCTGGCATA TCTTCGGATG GACTGTGGTA TGGACCTTAC TTGCATCAAC ACTTGCAATA
GTCCTAGGCT TTATCTTTGC TCTTCTAGTA AACAACGAGA GGGTTAAGGG AAAGAAGTTC
TTTAGAACGG TTTATCTACT TCCTTGGGCT ATACCTGCCT TTATCACAAT CATGTTCTTT
TCAATAATGA CAAGTCGTGG CGGGGTAATA GCAAATGCCC TAAACTCTTT GTTTAATATA
AGCCTAGATA TAAAGAACAA CACCTTCCAG ACTAGGGCCA CCTTAATTTT GCTCCAAGGT
TGGCTAGGAC ATTCTTATAT CTTCTTACTA ACAACAGGAG TGCTCCAGGC TATACCAAAA
GATTTATATG AAGCAGCAAG TATTGACGGG GCGACAGGAA TCCAAAGAAC CTTTAAGATT
ACAATTCCTT TAGTTCTTTT CCAAATAGCT CCAATGCTTA TTAACCAATA TACCTTTAAC
TTTAACAACT TCTCAATCAT CTACTTGTAC AACCAAGGTG GACCATTCAA CCCAGAAGTT
TATGGTAACC TTGCGGGAAG CTCAGATATT TTGATCTCCT ACATCTACAA GCTAACGATG
GAGAGTCAGT ACCAAGCTAT AGGTGCTGCA ATAACAGTAT TTATATCCAT AATCCTAATA
GTTATTTCAT ACTTTGGATA TAAGAATTCT TCAGCTTTTA AGGAGTATTA A
 
Protein sequence
MTYEKNLIDD IGEIRPRKNE YIIDSIKAEE SGGNKVDAKN HPYNQKLKKY EEEKKELLAK 
ADEAAKKDPN YKLDQKYLRD LYYRKFMANY CLDFYEKNKD LSYDSELDYK LCKLEYEQIP
KIIEHDLLLK SQLERASKRL DKLTREEIES AKKLSEEDRL VLKEKFESDN KSLEESFNKG
RISKKAFKSE KEQLKQKFKD QNKRLNYRNP EVSLKEEIES IKYKIDKDYK KEMKILEADE
AEARRRTPVE VEKTSAYRSI LTFPIPGLGQ ILNGQWQKGL LFLLGTLFIY LIAIPYALGF
GNYQGDGVAG LISLAQGGKR LDRSILFMIE GILSIVFVAI AALIYILSFK DVRTTEKNEM
KGIRPNNFFE TKKMLRTDGF PFLITAPALI VIVFIVIVPI LTAIMISFTN MDPQHQNKFT
WVGLNNYLTI AKGQGIAGQA FWHIFGWTVV WTLLASTLAI VLGFIFALLV NNERVKGKKF
FRTVYLLPWA IPAFITIMFF SIMTSRGGVI ANALNSLFNI SLDIKNNTFQ TRATLILLQG
WLGHSYIFLL TTGVLQAIPK DLYEAASIDG ATGIQRTFKI TIPLVLFQIA PMLINQYTFN
FNNFSIIYLY NQGGPFNPEV YGNLAGSSDI LISYIYKLTM ESQYQAIGAA ITVFISIILI
VISYFGYKNS SAFKEY