Gene Apre_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1039 
Symbol 
ID8397826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1107376 
End bp1109367 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content34% 
IMG OID644995387 
Producthelicase domain protein 
Protein accessionYP_003152788 
Protein GI257066532 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1200] RecG-like helicase 
TIGRFAM ID[TIGR00643] ATP-dependent DNA helicase RecG 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAA CGTCTCTAAA GGGCATAGGA GATAAGAAGA GCAAGGACTT TAAGAGGCTT 
GGTATATCTT CTGTATCAGA TCTTTATAAT TACTATCCTC GAGAATATGA GGACAGGAGC
AAGTTAAAAA GCATAGTCGA TATAGATGAT AACAACAAAC ATTATTTTGT CCGGAAAATT
TCTAGCAGAC TGTACCAGAA GAATTTCGGT AAAATGACTA TTTCCTATAT CTATGCCTTT
GAAGAAAAAT CGCAATTTAG AAATATAAGA CTAATATGGT TTAATGATAG ATTTACCCCA
AGAAGGCTTG TTAGAGGAAG GACATATAAG TTTTATACCT CAGTTTCTAA GAAGAATGCT
TTTTATGAAG CTGGGAATCC TTTATTTTGT GAAATGGATG ATGATGAAAT AGGCTCTATT
GCATCGATTT ACCCTCTTAC AAAGGGAATT AGTAATAAAC AAATCAGACA ATTTATGAGT
AGGGCTTTGG CTTACTTTGA TAGGAGAGAA GAAATTTTGT CCGATACTAT ATTAGAAGGA
TTCTCGCTAA ACAAAAGATA CGACAATCTT AAAGAGATTC ATTTTCCCAC GTCAGTCGAA
GGCTTAACCA AGGCAAAGAG TCAGATCAAA ATAGTGGACT TATTAAAAGA CTTATGCTTC
TTGGACTTTC TCAAAAGTAA GACTAAGTTT AGACAAGATA TTGATCTTGC TTACAAGCTA
GATGAGATTT TATCTGAGCT TTCTTTCACA CTCACCAGAA GTCAAAGAAG GGTTTTGGAA
GAAATTTTGG ATGATTGTAA GAGTCCTTAC ACAGCCAATA GACTCTTGGT TGGAGATGTA
GGAAGTGGTA AGACAATAGT TGCTATAGTA ATTATGATAA TTTTTGCCCT AAATGGCTAT
CAATCAGCTA TGATGGTACC TACTGAACTT TTGGCTATTC AGCAATTCGA AAAAAATATC
GAGCTGTTTG ATAAGTTTAA TGTAAGAGCA GCCTTACTTA CAGGAAGTAG CAAGGATAAG
GATAAACTTA AAGAGGACCT TAAAAATGGT AAAATTGACA TCCTTATTGG AACTCATGCA
ATCATAGTTG AAGATGTTGA CTTTAATAAT TTGAAGTTTG TCGTAAATGA TGAGCAACAC
AGATTTGGTG TAAGTCAAAG GCAAATGCTG GCCCTAAAGG GCGATGCTGT AAATTATCTT
ACAATGACAG CAACCCCTAT CCCTAGGACC CTGTCTCTTA AAATCTCAGA AATTATCGAC
TTATCTATAA TAAACGAACT TCCCAAGGGA AGAAAGCCTA TAATGACAAG GCTTTTGGGA
TCTGATAAGA TTGAAATCTT ATATGAGAAG ATAAATCAAA CTATAAGAGA AGGCAGGCAA
ATATATGTAG TTAGCAATAA TATTGACTCC GATGATAAAA ATTCTTTGGA AAATTTATAT
AAAATCTATA AGAATAGATT TCCACGATAT AGGATAGCTA TCCTTCATGG CAAGATGAAA
GCTAAGGATA AGGAAGATAT CTTAGGAGAT TTCAATAAGG GTAAAATTGA TATATTACTA
GCAACTACAG TTATTGAAGT AGGAATAGAT GTGGCCAATG CTAATACCAT GATAATATAT
AATGCCAATA ATTTCGGTCT TTCAACCCTC CACCAGCTTA GGGGAAGAGT AGGTAGGGGT
GAGTATGAAT CATACTGCTA TCTAATTTCT GATAATCCCT CTCCATCAAA TAAATTAAAT
GTCTTAGTAG AAAGTAATGA TGGATTCGAA ATAGCTAAGA AAGACTATGA ACAAAGAGGT
GGCGGCAAAA TCTTATCTTA TTTACAACAT GGAAAAAATC TTTCGACTGT AGATTTTCTT
AATATGAGTA AAGATGAAAT TGAAAAATGT TTTAAGATAT ATGAGAAGCT CAAAGATGGT
AACTACGAAG GAATCGACCT TACATATGTC GAAGAATATT TTAGAGAGAA TAAAAGGATT
ATTTTAAACT AA
 
Protein sequence
MDLTSLKGIG DKKSKDFKRL GISSVSDLYN YYPREYEDRS KLKSIVDIDD NNKHYFVRKI 
SSRLYQKNFG KMTISYIYAF EEKSQFRNIR LIWFNDRFTP RRLVRGRTYK FYTSVSKKNA
FYEAGNPLFC EMDDDEIGSI ASIYPLTKGI SNKQIRQFMS RALAYFDRRE EILSDTILEG
FSLNKRYDNL KEIHFPTSVE GLTKAKSQIK IVDLLKDLCF LDFLKSKTKF RQDIDLAYKL
DEILSELSFT LTRSQRRVLE EILDDCKSPY TANRLLVGDV GSGKTIVAIV IMIIFALNGY
QSAMMVPTEL LAIQQFEKNI ELFDKFNVRA ALLTGSSKDK DKLKEDLKNG KIDILIGTHA
IIVEDVDFNN LKFVVNDEQH RFGVSQRQML ALKGDAVNYL TMTATPIPRT LSLKISEIID
LSIINELPKG RKPIMTRLLG SDKIEILYEK INQTIREGRQ IYVVSNNIDS DDKNSLENLY
KIYKNRFPRY RIAILHGKMK AKDKEDILGD FNKGKIDILL ATTVIEVGID VANANTMIIY
NANNFGLSTL HQLRGRVGRG EYESYCYLIS DNPSPSNKLN VLVESNDGFE IAKKDYEQRG
GGKILSYLQH GKNLSTVDFL NMSKDEIEKC FKIYEKLKDG NYEGIDLTYV EEYFRENKRI
ILN