Gene Apre_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0019 
Symbol 
ID8396766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp24204 
End bp25817 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content39% 
IMG OID644994356 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003151795 
Protein GI257065539 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTG ATAAAAAAGT CTTATTTAAA ATAAGAAATC TAAAGAAGTA TTTCCCTCTT 
AAGAAAAAGT CTATATTTAA TAGAGGACCT GGCGAATATG TTCACGCTAA CGAATCCATT
AGTCTAGATA TCTATGAAGG AGAGACCCTG GGTCTTGTAG GGGAGTCTGG ATGTGGTAAG
TCTACTTTCG GTAGAACCCT CCTACAAATC TACGATCAAA CGGAAGGAAC TACTCTTTAC
TATGGAAAGA CCATAGAGGA TACTGCTCCA GCCTATATGC TAGATATGAT TAAGAAAATC
CCTAGCCAAT TTCCTAAATA TCACGAAGAA ATAGACGCCC TAAATAAGAT TTATGAAGAG
CTTGAAAACG CCGATACAGA CGAAAAAAGA GCAGAGCTTA ATGAAAAGGC TATGTTTAAG
CGACGTGACC TAGAGGAAAA CTACCTAAAT ATGGTAAGGA TTGCTGGAGG TCTTCTTGCA
AGTGATGACC TAAACAAGGT AGCTACTATG CTTAAGGAAA AATACGACAT CCTTAAACAA
AGAGCCGGCA TCCTAGAAAA GCTCGAAGAA ATTGAGCAAA AGACTCAAAT GAGATCAAGA
ACTTGGGAAG AATATGATAA ATTCCTAGAA AGTGATCCTA AGTATAAGGA TTTGATCAGC
CAAAAAGATG CCAAGACTAA AGAAATCAGT GAAAAAGATC AAACCATAGA AGCCTATAGA
AAGAGTCTCG AAAATAAGGA GAGATTTACC GAGCTTGAAG CAGAGAGAGA TAATGGAATC
GACCTATCTG AGCTTAACAA CGAGGAGATG AGAGCCCTAA GGAAGGACCT ACAAATGATC
TTCCAAGACC CTTACGGATC TTTGGATACA AAGATGACTG TAGGAAATAT CATAGGCGAA
GGTGTTCTAG GCCATGAACT CTTTAAGAGC AGAAAAGAAA AAGGATACAA TGAGTATATC
AGAGAAACTA TGGAAAAGTG TGGACTTGCT CCTTACTTCC TTCATAGGTA TCCTCACCAA
TTCTCAGGTG GACAAAGGCA AAGAATTGGT ATAGCAAGAG CCCTAGCCCT TAAGCCAAGC
TTTATAGTTT GTGATGAGGC TGTAAGTGCC CTTGACGTAT CTATCCAATC TCAGATAATA
AACCTTCTTC AAGACCTAAA AGATGAGAAC AACCTAACCT ATCTTTTCAT CACCCACGAC
CTATCGGTTG TAAAATATAT ATCAGATAGG ATTGGGGTAA TGTATCTTGG AGTTTTGGTA
GAGCTTTGTG AATCAGAGAG AATTTTCGAA AATCCTCTCC ATCCATATAC CAAGGCCCTC
CTACGTGCTA TACCAAGAAC AGACGTAGAC CAAGGCCAAG AGCTACAAGT CATCGAAGGA
GATATACCAT CAGCAGTAAA ACCTCCAAAA GGATGTAGAT TCCATACAAG ATGCGAGTAC
TCTATGGATA TCTGTGCCAA CTTCGAGCCA GAACTTAAGG AAAGAGAAGA TGGCCACTTC
GTAGCATGCC ACTTACTTGA TGTAAGCGAA GAAGAAAAAC AAAAGGCCTT TGAGAAAAAT
AAAATAGAAA AAGCTAAAAA GGAAGAAGAG CTAGAAGAAA TGAGCGCTAT ATGA
 
Protein sequence
MNSDKKVLFK IRNLKKYFPL KKKSIFNRGP GEYVHANESI SLDIYEGETL GLVGESGCGK 
STFGRTLLQI YDQTEGTTLY YGKTIEDTAP AYMLDMIKKI PSQFPKYHEE IDALNKIYEE
LENADTDEKR AELNEKAMFK RRDLEENYLN MVRIAGGLLA SDDLNKVATM LKEKYDILKQ
RAGILEKLEE IEQKTQMRSR TWEEYDKFLE SDPKYKDLIS QKDAKTKEIS EKDQTIEAYR
KSLENKERFT ELEAERDNGI DLSELNNEEM RALRKDLQMI FQDPYGSLDT KMTVGNIIGE
GVLGHELFKS RKEKGYNEYI RETMEKCGLA PYFLHRYPHQ FSGGQRQRIG IARALALKPS
FIVCDEAVSA LDVSIQSQII NLLQDLKDEN NLTYLFITHD LSVVKYISDR IGVMYLGVLV
ELCESERIFE NPLHPYTKAL LRAIPRTDVD QGQELQVIEG DIPSAVKPPK GCRFHTRCEY
SMDICANFEP ELKEREDGHF VACHLLDVSE EEKQKAFEKN KIEKAKKEEE LEEMSAI