Gene Apre_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1840 
Symbol 
ID8368747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013164 
Strand
Start bp103643 
End bp105418 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content25% 
IMG OID644984763 
ProductABC transporter related 
Protein accessionYP_003142414 
Protein GI256821215 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.301494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAGG ATAAAATAAA TCATCTTGAA TGTATAAAAA TAGCATTCAA GATGATTTAT 
GAAACAGATA AAAAGTCTTT TGCGATCATT ATTATATTGT CAATTGCAAG TGGTATTTTT
CCTTTTTTAG TACTTAGATT AGGACAGACA ATAGTTAATA TAATTCAAGT TCATTCTATT
AAATTTGATA TCATTATAAG GCTTATATAT ATATATTTAT CTTTACAATT TATATCTATA
ATAGTAGATA ATATTAAAAA TTACTATTTA CAAAGATTGA GTAATGAAGT TATTTATTCT
TCCATGAGCA AAGTAATGGG AAAATGTGCT GATTTACCTT TAAAAAAGCT TGAAGATAAT
AAAACTTATG ATATTTTGAA TAGAATAGAG CAGGATGCTA CATTAAAGCC ATATGAAATA
TTAATGGCGG TTATAGGTTT ATTTTCTAGT TTGACACAGA CTATTATAGC CATATATGTA
TTAATTAAAT GGAATTATTA TCTGGTTCTT CTTTTGTTTG TAGTTACCAT TGTATCAGTA
TTTGGAGAAA TAAGAATAGG TAAATTAGAG TTTAATATTA GAAATAGAAG GAGTAATTTA
GAGAGGAAAA GTTGGTATTA TTCTTTTTTA TTAACCCATG ATATTGCTTT TAAGGAGATC
AAAACTTTTA GGTTGAAAAA TTATTTTCTT AATAAGTATA CAGAAATAAC AGATACTATT
ATTAATCAAA ATAATAGCAT TGAAAAATTG AAAGCTATAT TAATAATTGT TATAAATTTT
ATTCAAATTC TTATTAATAT ATATATATTT AAAGAACTTG CATTTAAAAC GTATAATGGA
GACTTTTTAA TTGGAACAGC TATGATGTAT ATAAACACAA TTGCTATATT TCAAGGTTCT
CTTAATGAAA CAGGAACTAG TGTATACAAT ATTATTAATT CAAATTTATA TATAAATTTA
CTTAAAGAGT TTCTTGAATT TAAAACTGAC GATTTAAAAG AAGAGACAAA AACTATACTA
AGATCAATAA AAGACATTAA TGTAATATCG TTATCTAATA TAAACTTTTC TTATGATGAT
AATAGTTTAG CTCTGCAAGA CATTTCTTTA AAGATAAAAA AAGGAGAATC CATTGCAATA
ATTGGGGAAA ATGGTTCAGG AAAAAGTACA CTTCTAAAAA TTTTAGCTGG ATTGTATAGT
CCAGATTCAG GTGTGTTTTT AATTAATGGG ATGAAATTTG ATGATATTGA AATAGAATCT
TATAGAACAC AGATTAGTTC ATTATTTCAA GATTATTTGA AATATGAGGG GACAATAAAA
GAGAATATTA TATTAGGGCA AATTGATAGA AATGAAGACG ACTTTTCGAT TCTAACAGCA
TTAAATAGTG CTGATGCAAA ATTTTTAAAA AATGATGGTA AATATAATAT TAATAAAGTA
GTAGGTAATT GGTTTGAAAA TGGGCAGGAA CTTTCAGGCG GACAATGGCA AAAAATTGCA
ATAGCACGAA CTATGTATAG AAAATCAAGT CTATTGTTAT TTGATGAGCC AAGCTCGTCA
TTAGACATAA TTTCTGAAAA AATCATATTT GATAATATTT TAAATAATTT GAATGATAAG
ATTATTATAT ATATCACCCA TAGGATAAGA TGTGCGATGA ATTCTGATAG AATTATTGTG
ATGGATAATG GAAAAATAGT AGGTGATGGG AGTCACGATG ATTTAATTGA AAACTGTAAT
AGATATAAGT TAATGTATAA TAAGGAATTT AAGTGA
 
Protein sequence
MGKDKINHLE CIKIAFKMIY ETDKKSFAII IILSIASGIF PFLVLRLGQT IVNIIQVHSI 
KFDIIIRLIY IYLSLQFISI IVDNIKNYYL QRLSNEVIYS SMSKVMGKCA DLPLKKLEDN
KTYDILNRIE QDATLKPYEI LMAVIGLFSS LTQTIIAIYV LIKWNYYLVL LLFVVTIVSV
FGEIRIGKLE FNIRNRRSNL ERKSWYYSFL LTHDIAFKEI KTFRLKNYFL NKYTEITDTI
INQNNSIEKL KAILIIVINF IQILINIYIF KELAFKTYNG DFLIGTAMMY INTIAIFQGS
LNETGTSVYN IINSNLYINL LKEFLEFKTD DLKEETKTIL RSIKDINVIS LSNINFSYDD
NSLALQDISL KIKKGESIAI IGENGSGKST LLKILAGLYS PDSGVFLING MKFDDIEIES
YRTQISSLFQ DYLKYEGTIK ENIILGQIDR NEDDFSILTA LNSADAKFLK NDGKYNINKV
VGNWFENGQE LSGGQWQKIA IARTMYRKSS LLLFDEPSSS LDIISEKIIF DNILNNLNDK
IIIYITHRIR CAMNSDRIIV MDNGKIVGDG SHDDLIENCN RYKLMYNKEF K