Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aave_2000 |
Symbol | |
ID | 4668490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax citrulli AAC00-1 |
Kingdom | Bacteria |
Replicon accession | NC_008752 |
Strand | + |
Start bp | 2172455 |
End bp | 2175310 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639823211 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_970358 |
Protein GI | 120610680 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.277994 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000144828 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAATGCGA CGCCTTCCCG CAAGACCGAC ACCGCGCCCG AACCCCAGGC CTCGCCGTCG CCCGGACGCC GCGCCGACAA GGACCAGCCG CTGATCCAGG ACATCCGCCT GCTCGGGCGC ATCCTGGGCG ACGTGATCCG CGAGCAGGAG GGCGTGGCCG CCTACGACCT CGTGGAACAG GTGCGCAAGC TGTCGGTGGC GTTCCGCCGC GATGCGGACC AGGAGGCGGA CAAGGCGCTC AAGAAGCTGC TCAAGGGCCT GTCGGGCGAC CAGACGGTGA GCGTGATCCG CGCCTTCACG TATTTCAGCC ACCTGGCGAA CCTCGCCGAG GACCGGCACC ACATCCGCCG CCGCACCGTG CACGAGCGCG CCGGCAGCAC GCAGGAAGGC AGCATCGAGG TGGCCCTGGC CCGGCTGCGC TGGGCGGGCA TCGCGCCGCG CGCCGTGGCG CAGACGCTGG CACGCAGCTA CGTGTCGCCG GTGCTGACCG CGCACCCGAC GGAGGTGCAG CGCAAGAGCA TCCTGGACGC GGAGCGCGAC ATCGCCGAAC TGCTGGCCGA GCGCGACGAG ATCCAGGCGC GGGGGCTGTT CTACAAGTCC GCGCGCGATG CGCTCACGCC GCGCGAGCTG GCGGCCAACG AGGCGCGGCT GCGCGCCCGC GTCGCGCAAC TGTGGCAGAC CCGGTTGCTG CGCTATTCGA AGCTCACGGT GGCCGACGAG ATCGAGAACG CGCTCTCCTA TTACGAATCG ACCTTCCTGC GCGAGATCCC GCGCCTGTAC GAGGACCTGG AGCGGGCGCT GGGCAGCCAG CCGGTGGCGA GCTTCCTGCG CATGGGCCAG TGGATCGGCG GCGACCGCGA CGGCAATCCC AACGTGAGCG CCGACACGCT GGCGCTCGCA CTGCGCCGCC AGGCCGAGGT GGCGCTGCGC CACTACCTGA CCGAGGTGCA CTACCTGGGC GGCGAGCTGT CGCTGTCGGC CCGGCTCGTG CAGGTCTCTC CGGAGATGCA ACAGTTGGCC GAACGCTCGC CCGACACCAG CGAGCACCGC CAGGACGAGC CCTACCGCCG CGCGCTCACC GGCATCTATG CGCGGCTGGC GGCGACGCTG CGCGAATTGA CGGGCGGCGA GGCGGCGCGC CATGCCGTGG CGCCCCAGAA CCCGTACGGC GACGCTTCGG AGTTCCTGGC GGACCTGCGC GTCATCGAGG CCTCGCTGCG GTCGCACCGC GGGGCGCCGC TGGCGGCGCA GCGGCTGCAT CCGCTGGTGC GGGCCGTGGA GGTGTTCGGC TTCCACCTGG CCACGGTGGA CCTGCGGCAG AGCTCCGACC AGCACGAGCG CGTGGTGGCG GAGCTGCTGG CCACGGCCCG GATCGAGGCT GATTACGCGG CCCTGCCGGA AGATGCGCGC CAGGCCCTGC TGGTGCGCCT GCTGTGCGAC GCCCGGCCGC TGCGCGTGGT GGGCGCGGAA TACTCGGAGC ACACGCGCGG CGAGATCGCC ATCTTCGAGA CGGCGCGGCG CATGCGCGAC CGCTACGGCG CCCAGGCCAT CCGGCACTAC ATCATCAGCC ACACCGAGAC GGTGAGCGAC CTGCTGGAAG TGCTGCTGCT GCAGAAGGAA GTGGGCCTGA TGCACGGCAC GCTGGACGGG GCCGATGCCC GCGCCGACCT GATCGTGGTG CCGCTCTTCG AGACCATCGG CGACCTGCGC AACGCCCAGC CCATCATGCG CGCGCTCTAC GAGGTGCCGG GCTTCGCGGC GATGGTGCAG CGCGGCGGGG CCGAGCAGGA CATCATGCTG GGCTACTCGG ACAGCAACAA GGACGGCGGC ATCTTCACCA GCAACTGGGA GCTGTACCGC GCCGAGATCG CGCTGGTGGA GCTGTTCGAC GCGCTGGAGG CGCGGCACCG CATCCGCCTG CGGCTCTTCC ACGGCCGCGG CGGCACGGTG GGGCGCGGCG GCGGCCCGAG CTACCAGGCC ATCCTGGCGC AGCCGCCCGG CACGGTGCGC GGGCAGATCC GCCTGACCGA GCAGGGCGAG GTGATCGCCT CCAAATATGC CAACCCGGAG ATCGGCCGGC GCAACCTGGA GACGCTGGTG GCGGCCACGC TGGAGGCCAC GCTGCTGCAG CCCACCAAGC CTGCCACCAA GGCCTTCCAC GAGGCCGCGG CCGAACTCTC GCAGGCGAGC ATGGCGGCCT ACCGCGCGCT GGTGTACGAG ACGCCGGGCT TCGCCGACTA TTTCTTCAAC TCCACGCCGA TCCGGGAGAT CGCCGAGCTC AACATCGGCT CGCGCCCGGC CTCGCGCAAG CCGAGCCAGA AGATCGAGGA CCTGCGCGCC ATCCCCTGGG GGTTCAGCTG GGGCCAGTGC CGGCTCACGC TGCCGGGCTG GTACGGCTTC GGATCGGCCG TGGAGGCCTT CGTGAAGGCC GCGGGCGAGG ATCCCAAGGC CCGCATGGCG CTGCTGCAGA AGATGTACCG GCAGTGGCCG TTCTTCCGCA CGCTGCTGTC CAACATGGAC ATGGTGCTGG CCAAGAGCGA TCTGGCCCTG GCGTCGCGCT ACAGCGAACT GGTGGCGGAC GCGCGCCTGC GGCGCAAGGT GTTCGGTGCC ATCGAGGCCG AATGGCAGCG CACGGTGGAT GCGCTGGCGC GCATCACCGG CGACCGGCAG CGGCTGGCCC ACAACACGGC CCTGGCGCGC TCGATCCGCC ACCGTTTCCC CTACATCGAT CCGCTGCACC ACCTGCAGGT GGAACTGGTG CGGCGCTGGC GCGCGGGCGA GGGCAGCGAG CGCGTGCAGA CCGGTATCCA CATCTGCATC AACGGCATAG CGGCGGGTCT GCGCAATACC GGCTGA
|
Protein sequence | MNATPSRKTD TAPEPQASPS PGRRADKDQP LIQDIRLLGR ILGDVIREQE GVAAYDLVEQ VRKLSVAFRR DADQEADKAL KKLLKGLSGD QTVSVIRAFT YFSHLANLAE DRHHIRRRTV HERAGSTQEG SIEVALARLR WAGIAPRAVA QTLARSYVSP VLTAHPTEVQ RKSILDAERD IAELLAERDE IQARGLFYKS ARDALTPREL AANEARLRAR VAQLWQTRLL RYSKLTVADE IENALSYYES TFLREIPRLY EDLERALGSQ PVASFLRMGQ WIGGDRDGNP NVSADTLALA LRRQAEVALR HYLTEVHYLG GELSLSARLV QVSPEMQQLA ERSPDTSEHR QDEPYRRALT GIYARLAATL RELTGGEAAR HAVAPQNPYG DASEFLADLR VIEASLRSHR GAPLAAQRLH PLVRAVEVFG FHLATVDLRQ SSDQHERVVA ELLATARIEA DYAALPEDAR QALLVRLLCD ARPLRVVGAE YSEHTRGEIA IFETARRMRD RYGAQAIRHY IISHTETVSD LLEVLLLQKE VGLMHGTLDG ADARADLIVV PLFETIGDLR NAQPIMRALY EVPGFAAMVQ RGGAEQDIML GYSDSNKDGG IFTSNWELYR AEIALVELFD ALEARHRIRL RLFHGRGGTV GRGGGPSYQA ILAQPPGTVR GQIRLTEQGE VIASKYANPE IGRRNLETLV AATLEATLLQ PTKPATKAFH EAAAELSQAS MAAYRALVYE TPGFADYFFN STPIREIAEL NIGSRPASRK PSQKIEDLRA IPWGFSWGQC RLTLPGWYGF GSAVEAFVKA AGEDPKARMA LLQKMYRQWP FFRTLLSNMD MVLAKSDLAL ASRYSELVAD ARLRRKVFGA IEAEWQRTVD ALARITGDRQ RLAHNTALAR SIRHRFPYID PLHHLQVELV RRWRAGEGSE RVQTGIHICI NGIAAGLRNT G
|
| |