Gene Apre_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0119 
Symbol 
ID8396870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp141386 
End bp143089 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content39% 
IMG OID644994457 
Producthypothetical protein 
Protein accessionYP_003151892 
Protein GI257065636 
COG category[S] Function unknown 
COG ID[COG4907] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00804511 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA AAAGACTAAG CAAATATCTT CTAGCCCTTA TCCTTTTTCT ACTTCCTAGA 
ATCTCACTTG CCGATAGTTT CGATTCACTA GACATGGACA TTACGATCGA TAAGAATGGT
GTAGGAAGTG TAGAAGAAGT TTGGCAAATA GATGAAGACG AAAGAGATTA TACCGAAAGA
TACAAATTAA TAGAAAATTT AAGAGGAATA AAAATAGAAG ACTTCTCCCT AACTTCTCCT
TCCCTAGGCA AGGACTTTAG TGAAATGGAT CCTTGGGATT CAAACCTTTC TTTTGAAGAA
AAAGCCTATA AATATGGAAG AAGCGATAGG GAGGATGAGA CAGAACTCAT TTGGGGGATA
TTCCAATACG GAAAAAACAC CTACAAGCTA ACTTACAAGA TAAATCCTCT AATAATAGGC
CTTGAAGATA GCGATATGCT CTTTTTCCAA TTTGTCGGAG AAAACTTTGA CCCCAAGCCA
GAAAGGGTCA ATATCAATAT TAAAGGCTTT GAGCCCTTCG ATCGAAATGT AAAGATGTGG
GGCTTTGGCC TGGATGGAGA TATCCACAAT GCCAGTGGAA ATATAGTCCT TAAATCTACT
GGCGAAGTTG ACTATACGAC TATCATGCTA AAATTCCCTA AGGGCTATTT CAATACATCC
TATAAGGAAG ATAAAACCTT CGACGATTAT GCTAATGAGG CAATTAAGGG ATCAAAATGG
GAAGAGCGAG AAGGAGAGGC TAATACCGAC CCAACCCCTT GGTATGTCAA AGTAATCCTT
CCCCTTGTTC TTCTCTTAGG TCTAGGATCA ATCTTTCTTG GAGTGAGGGC GAAAAAGCTT
CATTTTGATG AGAATAATAT CACAAATGAT GAGACCCTCA AGAAAGCCAA GACCTTCAAA
GACCAGTACT TTAGAGATAT CCCCTATGAT TCCCATATAG AAGATACCTA TCTATTGGCA
GAGAAGGCCT ACCCTTACGA GATAAATCAG GGAAATTACA TGAACGCCTT TATCCTAAAA
TGGATTTACG ATAAAAATAT CGAAATCGAA GCTAGAAAGG AAAAGGATAA AAATGCCAAG
ATAAGAATCC TCGCAAGACC TAGTGATATG GGCAAGATGG AGGAGGCTTT CTTCAATATA
ATCGAAAAGT CACAAGAATA CTCCAAAGAC GGTCTTGTCT CAAACAAAAA CATCCAAAAA
TACTTCAAGA AAAACAAGGA AATCACAGAA GACTTCTATG AAGACTTCTT ACCTAGGTCA
AGGGATGGCC TGAAGTCTGG AGAATATCTA AAAGATATAA GATTCGAGAA GAAATTCGCT
AGGACTAGGA GAGAAGGATC CGAGCTTGAA GTTACTCCTA AGGGAGTTGA CCTTTACGAA
AATCTAATCA AGTTTAGAAA TTACCTAGAA GATTATTCCT TGATAGAGGA AAGAGATGCT
AATGAAGTTC ATATTTGGGA CTACTTCCTA ATATATGCAG CAATTTACGG CATAAGTGAT
AAAGTCTTTA AAAACCTAGA GAAGACTTAT CCAAACTATA GCAACAACTC AGTCTTTAAC
TACTATATGA TTACAAATTC AAGAAACTAC TCCACTATGA CCCAAGCCAA TATAGCAAGC
TTTACAGCAG CAGGATCTGG CGGAAGCACA TCCTTTGGTG GAGGTGGCGG AAGCTTCGGT
GGAGGTGGTG GCGGAGGTCG CTAG
 
Protein sequence
MKIKRLSKYL LALILFLLPR ISLADSFDSL DMDITIDKNG VGSVEEVWQI DEDERDYTER 
YKLIENLRGI KIEDFSLTSP SLGKDFSEMD PWDSNLSFEE KAYKYGRSDR EDETELIWGI
FQYGKNTYKL TYKINPLIIG LEDSDMLFFQ FVGENFDPKP ERVNINIKGF EPFDRNVKMW
GFGLDGDIHN ASGNIVLKST GEVDYTTIML KFPKGYFNTS YKEDKTFDDY ANEAIKGSKW
EEREGEANTD PTPWYVKVIL PLVLLLGLGS IFLGVRAKKL HFDENNITND ETLKKAKTFK
DQYFRDIPYD SHIEDTYLLA EKAYPYEINQ GNYMNAFILK WIYDKNIEIE ARKEKDKNAK
IRILARPSDM GKMEEAFFNI IEKSQEYSKD GLVSNKNIQK YFKKNKEITE DFYEDFLPRS
RDGLKSGEYL KDIRFEKKFA RTRREGSELE VTPKGVDLYE NLIKFRNYLE DYSLIEERDA
NEVHIWDYFL IYAAIYGISD KVFKNLEKTY PNYSNNSVFN YYMITNSRNY STMTQANIAS
FTAAGSGGST SFGGGGGSFG GGGGGGR