Gene Aave_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_1930 
Symbol 
ID4669105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp2091588 
End bp2094371 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content69% 
IMG OID639823140 
ProductDNA polymerase I 
Protein accessionYP_970288 
Protein GI120610610 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.173956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0298694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC CCAAGACCCT GGTGCTGGTG GACGGCTCCA GCTACCTCTA CCGCGCCTTC 
CACGCCATGC CGGACCTGCG GGCCGTGCCC GGCGACCCGG CCAGCCCCGC CACCGGGGCC
ATCCGCGGCA TGATCAACAT GCTGCAGGCG CTGCGCAAGG ACTATCCCGC CGACTACGCC
GCCTGCGTCT TCGACGCGAG CGGACCGACC TTCCGCGACA CGCTCTACCC CGAATACAAG
GCGCACCGCG CCCCCATGCC CGACGACCTG CGCGCCCAGA TCGAGCCCAT CCACCAGGTC
GTGCGCCTGC TGGGCTGGCC GGTGGTCTGC GTGCCGGGCG TGGAGGCCGA CGACGTGATC
GGCACCCTCG CGGCCTGCGC GGCGCGCCAG GGCATCGAGG TCATCGTCTC CAGCGGCGAC
AAGGACCTGT CGCAACTGGT GGACGAGCAC ATCACCATCA TCGACACCAT GAGCGGCAAG
CGCCGCGACG TGGCGGGGGT CACGGCCGAG TTCGGCGTGC CGCCCTCGCT CATGGTCGAT
TACCAGACGC TGGTGGGCGA TGCGGTGGAC AACGTGCCCG GGGTGCCCAA GGTGGGCCCC
AAGACGGCTG CCAAGTGGCT GAACGAATAC GGTTCGCTCG ACGGGCTGCT GGCGCGCGCG
GGCGAGATCA AGGGCGTGGC GGGCGAGAAC CTGCGCAACG CGCTCGACTG GCTGCCCAAG
GGGCGCGAGC TGGTCACCAT CCGCACCGAT TGCGACCTGG CCGGGCATGT GGAAGGCCTG
CCGGCGCTGG ACGGCATCGC GGCCGGCGCG CTGGAGACGG CGCAGCTCCG CGAGTTCTAC
GAGAAGTTCG GCTTCAAGGG GCTGGCCCGT GCGCTGGAAG ACACCACGCC CGCCAATGCC
ACGGCCACGC TACCCGGTGC GAGCGGCGAC CTGTTCGCCG ACAGCGGCAC GACCTTCGTC
GCCGAGGCCG CACAGGGGCG CGAAGTGGCC TACGACACGA TCCTCGCTTG GGAGGAGTTC
GACCGCTGGT TCGCCCGCAT CCAGGCGGCC GACCTGGTCG CGCTCGATAC CGAGACCAAC
TCGCTCGACG AACTGCGCGC CCAGATCGTG GGCCTGAGCT TCAGCGTGGA GCCCGGTGCC
GCCGCCTACA TTCCGCTCGC GCATTCCGGC CCCGAGGCGC CCGAACAACT GCCGCTCGAC
ACCGTGCTGG AGCGCCTGAA GCCCTGGCTG GAGGATGCTT CCCGGCCCAA GCTGGGCCAG
CACGTCAAGT ACGACCGCCA CGTGTTCGCC AACCACGGCA TCGATGTACG CGGCTATGCG
CACGACACCA TGCTGCAGAG CTACGTGCTG GAAGTGCACC GGCCGCACAA CCTCGCGAGC
CTGGCCGAGC GCCACACGGG CCGCACCGGC ATCAGCTACG AAGACCTCTG CGGCAAGGGC
GCCAAGCAGA TCCCGTTCGC GCAGGTGCCG GTCGAGCAGG CCGCGGCCTA TTCCTGCGAA
GACTCGGACC AGACGCTCGA CGTGCACCGC GTGCTCTGGC CGCTGATCGA GGCCGACGAG
AAACTGCGCG CCATCTACGC CCTCGAGATG GAGAGCAGCG AGGTGCTGTT CCGCATCGAG
CGCAACGGCG TGCTGATCGA TACGGCCACG CTGCAGCAGC AGAGCCACGA CCTCGGGCAG
CGCATCCTCA GGCTCGAGCA GGAGGCCTAC GACATCGCCG GCCAGCCGTT CAACCTCGGC
AGCCCCAAGC AGCTCGGCGA GATCTTCTTC GACAAGCTGG GCCTGCCCGT GGTGAAGAAG
ACCGCGACCG GCGCGCGGAG CACCGACGAG GAGGTCCTGG AAAAACTCGC CGAGGACTAC
CCGCTGCCCG CCAAGATCCT GGAGCACCGC AGCCTCTCCA AGCTCAAGGG CACCTACACC
GACAAGCTGG GCCAGCTGGC CGATCCGCGC ACGGGCCGGG TGCATACGCA CTACGCGCAG
GCCGTGGCGG TGACGGGGCG CCTGTCGAGC AACGAGCCCA ACCTGCAGAA CATCCCCATC
CGCACGGCCG AAGGCCGGCG CGTGCGCGAG GCCTTCGTGG CGCCGCCGGG CCGCGTGATC
GCCAGCGCCG ACTACAGCCA GATCGAGCTG CGCATCATGG CCCACCTGAG CGGCGACGAC
GCGCTGCTGC GCGCCTTCAC CGAGGGGCTG GACGTGCACC GCGCCACCGC GGCCGAGGTG
TTCGGCGTGG CGGTGGACCA GGTGAGCAGC GAGCAGCGCC GCTATGCCAA GGTGATCAAC
TTCGGGCTCA TCTACGGCAT GAGCAGCTTC GGCCTCGCGC GCAACCTGGG CATCGACAAC
AAGGCCGCCG CGGCCTACAT CGACCGCTAT TTCCAGCGCT ACCCCGGCGT GAAGCAGTAC
ATGGACGAAA CCAAGGCCTC GGCCAAGGCG CGCGGCTATG TCGAGACGGT GTTCGGCCGG
CGGCTCTACC TGCCCGAGAT CAATTCGCCC AACGGCCCGC GGCGCGGCGC CGCAGAGCGC
GCCGCCATCA ACGCGCCCAT GCAGGGCACG GCGGCGGACC TCATCAAGAA GGCGATGGTG
GCCGTGCAGG CGGTGCTGGA CGCGGACAAG CCGCAGGTGC TGATGATCAT GCAGGTGCAC
GACGAACTGG TGTTCGAACT GCCCGAAGAG GAATCCGGCT GGCTGCGCAC CGAAGTGCCG
CGCCTCATGG CCGGCGTGGC GGAGCTCAAG GTGCCGCTGC TGGCGGAAGT GGGCATGGGG
CCGAACTGGG AGAAAGCGCA CTGA
 
Protein sequence
MSKPKTLVLV DGSSYLYRAF HAMPDLRAVP GDPASPATGA IRGMINMLQA LRKDYPADYA 
ACVFDASGPT FRDTLYPEYK AHRAPMPDDL RAQIEPIHQV VRLLGWPVVC VPGVEADDVI
GTLAACAARQ GIEVIVSSGD KDLSQLVDEH ITIIDTMSGK RRDVAGVTAE FGVPPSLMVD
YQTLVGDAVD NVPGVPKVGP KTAAKWLNEY GSLDGLLARA GEIKGVAGEN LRNALDWLPK
GRELVTIRTD CDLAGHVEGL PALDGIAAGA LETAQLREFY EKFGFKGLAR ALEDTTPANA
TATLPGASGD LFADSGTTFV AEAAQGREVA YDTILAWEEF DRWFARIQAA DLVALDTETN
SLDELRAQIV GLSFSVEPGA AAYIPLAHSG PEAPEQLPLD TVLERLKPWL EDASRPKLGQ
HVKYDRHVFA NHGIDVRGYA HDTMLQSYVL EVHRPHNLAS LAERHTGRTG ISYEDLCGKG
AKQIPFAQVP VEQAAAYSCE DSDQTLDVHR VLWPLIEADE KLRAIYALEM ESSEVLFRIE
RNGVLIDTAT LQQQSHDLGQ RILRLEQEAY DIAGQPFNLG SPKQLGEIFF DKLGLPVVKK
TATGARSTDE EVLEKLAEDY PLPAKILEHR SLSKLKGTYT DKLGQLADPR TGRVHTHYAQ
AVAVTGRLSS NEPNLQNIPI RTAEGRRVRE AFVAPPGRVI ASADYSQIEL RIMAHLSGDD
ALLRAFTEGL DVHRATAAEV FGVAVDQVSS EQRRYAKVIN FGLIYGMSSF GLARNLGIDN
KAAAAYIDRY FQRYPGVKQY MDETKASAKA RGYVETVFGR RLYLPEINSP NGPRRGAAER
AAINAPMQGT AADLIKKAMV AVQAVLDADK PQVLMIMQVH DELVFELPEE ESGWLRTEVP
RLMAGVAELK VPLLAEVGMG PNWEKAH