Gene Apre_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1756 
Symbol 
ID8368671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013164 
Strand
Start bp14825 
End bp16915 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content25% 
IMG OID644984687 
Productprotein of unknown function DUF214 
Protein accessionYP_003142338 
Protein GI256821139 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00569869 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAA GAATATCTAA TTTAAGTCTA AAAGATAAAG TAGATAAATA TAAGACTTAT 
TTCATATCAA TTATTGTTTG TGCGACTTTA TTTTTTTCAT TTCTTTCTAT AGCTGATTCT
AAAAGTGTGA TAATGTCAAA TGATTTATAT GATTTTTCAT ATTTTGAAAC AATAATAAGG
TGGTTAATAT ATATAGCATC AGCAGGGATA TTTTTGATAT TAAATTTCGT AAATGGAAAT
ATTTATAAAA TTAGAATTAA AAGCATATCA GTATTAAATA TTATGGGAGC AAAAAGATCT
ACCTTGGCTA AAATACTTGC TATGGAGATG CTACCTATTT ATTGTATAGG TGTTTTAATT
GGAATTTGTG TAGGTCAAAT CACTTCTCAA TTTATAAATG CTTTTATAGC AAATGCAATT
TTAGAAAAAT ATTCCATAAA TTTAGTATTT TATGCTAGTA CCCTTCTTAG AACTATTATT
TATTTTGCTT TGATATTTCT TATAACAAAT CTTTTTAATG CTTTTAAGAT AATTAAAAAG
AAACCAATAG ATTTAATAAA TGATGGGAAA AATATTAATA AAAAGACTGT ATCAAAAACT
AAGTTAATAA TATCAAGCCT CATCTTCATT ATCTGCTTAG TATATATATC ATATAATATA
TACACTTATT TTAATTTAGA TAGAGACTTT ACAGGTCCAA TACCAAATTA CGAAAGCAAT
AAAGTTCAAG CTAGTCTTTT AATAGCGTCT ATACTACTTA TGTATTCATT TTTCTATACT
ATTATTTATT TTTTAGATAA GAAACGTAAA AATGAAAGCA TATATCAAAA AGATAAGCTC
CTCGTATCAT CTAATATTCA AATGAATATA TTTGAAAATG TTAGGTTATT AGTAACTATA
GTTATATTCT TGGTTATTTC TATTTTGGCT TTTTCATTGC CTAGAATCAT GTCTACGATT
GGGGAAGAGA ATTATCATAA CAGGATGAAA AATGATATAT ATGTTCTCAC TGATTTTTAT
GTCCTGGACA GTCAAAAAGA TGTTATAGAG AATGATTATT CTTTTATTGA AAATTTAGTA
GAAGAGAAGG GGGCAAATTT AGAAGGAAGC GTAGAACTTA AATATTTTTA TCCAAGAAAA
GAAGATTTTA AACCTTATAG TAAGAAGAAT AATAGATATG ATGAATCTCG TTTAGCTATA
TCTTTAAGTC AATACAATGA ATTAAGGAAA ATGCAAAACT TAGATGAAAT AGAACTTAAA
GACAATGAAT TTGCCTACCA ATTAGCAAAT ACTGAAGACA TAGACCAGTA CAAAGATAAC
TTGATAAAAA ATAGAAAATT AAAATTAGAT ACAGTAGAGC TAGTTGGTAA AGGACAAGGA
CTTGATTATG TTTACACTGA AAATCTAGGA ACTTATATTT ATGGTGGTAT AAACAAGGAT
TTGTTAATCT TTCCAGATAA AGTAACAGAA AATCTCGATC TTGCAAAAAT TAACTTTGTT
GGCAATACTG ATGGAGGGTT AGATTATAAA AGTGCAAAAG AATTAGAAAA TGAAATAGAA
ACGACTGTTA ATAAACAGTT TACAAATTTG GAAGATAAAT ATAAAGATGA GTTAGAAAAA
GAAAATTCAG AAGGATTTTT ACAAGTTATT AGGCTTGATG AAATAGAAAA AGTGGACACT
AAATTTATGT CACTTTTGAC TTTAGTATTG GGAACATATA TTGGATTTAT ATTTACAATC
ATTGTAATGA CAATTCTATT AATACAGTCC CTGATAAATA CCAAAAACTC ACTAGAAAAC
TATAAAATGT TAGAAATTCT TGGTTTAGAT AAGAATAGGG TTTTAAGTAT AAATAACAAA
ATCACACGCT CTTTCTCAAT GATACCACTT ATTATTTCTA TTTTTATAAT AGTATCTATA
CTAATATCTA TCTACATTCG ATTTAAAAAT AGGATTAAGC TTTTTATTGG ATTAGATAAC
TATATTTACT CGCTTTTAAT AAGTATTTGT CTTGTATTAG TTTTCATAGT TCTATATATT
TTTATTATAA TTAGAGAAAA TAAGAAGCTA GTAAATCAAA AAATGAGATA A
 
Protein sequence
MRIRISNLSL KDKVDKYKTY FISIIVCATL FFSFLSIADS KSVIMSNDLY DFSYFETIIR 
WLIYIASAGI FLILNFVNGN IYKIRIKSIS VLNIMGAKRS TLAKILAMEM LPIYCIGVLI
GICVGQITSQ FINAFIANAI LEKYSINLVF YASTLLRTII YFALIFLITN LFNAFKIIKK
KPIDLINDGK NINKKTVSKT KLIISSLIFI ICLVYISYNI YTYFNLDRDF TGPIPNYESN
KVQASLLIAS ILLMYSFFYT IIYFLDKKRK NESIYQKDKL LVSSNIQMNI FENVRLLVTI
VIFLVISILA FSLPRIMSTI GEENYHNRMK NDIYVLTDFY VLDSQKDVIE NDYSFIENLV
EEKGANLEGS VELKYFYPRK EDFKPYSKKN NRYDESRLAI SLSQYNELRK MQNLDEIELK
DNEFAYQLAN TEDIDQYKDN LIKNRKLKLD TVELVGKGQG LDYVYTENLG TYIYGGINKD
LLIFPDKVTE NLDLAKINFV GNTDGGLDYK SAKELENEIE TTVNKQFTNL EDKYKDELEK
ENSEGFLQVI RLDEIEKVDT KFMSLLTLVL GTYIGFIFTI IVMTILLIQS LINTKNSLEN
YKMLEILGLD KNRVLSINNK ITRSFSMIPL IISIFIIVSI LISIYIRFKN RIKLFIGLDN
YIYSLLISIC LVLVFIVLYI FIIIRENKKL VNQKMR