Gene Apar_0663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0663 
Symbol 
ID8413523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp738740 
End bp740020 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID645022240 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_003179683 
Protein GI257784466 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTT CAATAACTCC TCATGATTTG TTTGGAACGG TAGAGGCCAT TTCTTCCAAA 
TCATACGCTC ATCGCGCGCT CATCTGCGCT TCGTTAGCAA ACGGCACCAC TAACATTACA
TGTCCCTACA TCTCAGAGGA CATTCAAGTA ACTGTTGATT GTCTTAAAGC TTTGGGCACA
ACTATTGCTA GAACTAAACA GGGATTTAGA GTTGTCCCTG CAAAAGAGGC AAACCGTCCT
CAAAATATTC AGCTTAACTG CAAAGAATCT GGTACCACGT TCAGGTTTAT ACTCCCCCTT
CTTGGAGCGC TTAATATAAG TGCCACAATT TCTGCAGAAG GAAGACTCTT CTCAAGACCT
CTTGAACCTC TTATCTCAGA GCTCTCTTCC CATGGTATGT CTTTTACTTG GTTAGACGAA
AAATTCCTTG AGGTTTCTGG CCAGCTTGAT GGTGGTTCGT TTGAGATGCC GGGAAATATT
TCTTCCCAGT TTATTTCTGG ACTCTTGCTG TCATTACCCT TATTGAATAA ACCTTCAACT
ATCCAAATTA CAGGACCCAT TCAATCAAAA GACTATCTAG CTATTACTGA GCATGTTATG
ACAGACTTTG ACATTACTAC CCCGTTTGAA GTAAACAGTG CCACCTATAC CATTGAGGCT
CAACACTATG TCTGTCCAGG CGTTTATTCA ATTGAAGGAG ATTGGTCAAA TGCTGCTCCT
TGGCTTGTTG CGGGAGCCAT TGGTCAAGGT GTAGAGGTCA CAGGACTCTC CATGCAAAGT
ACCCAAGGAG ACAAAGCTAT TTTAGCCGCG CTCTCTCTTG TGGGCGCACG TGTTTCTCGC
CAGCAAAAAG GTGCTGCAAG CATGATGGAT CACTTAAGGC CATTTTCCAT CTCTATCTCT
GAAGTATGCG ACCTCGCTCC GGTACTTGCT GCGCTAGCAG CGTTTATCCC TGGAACAAGC
AAGCTCACTG ATATTCAGCG CCTGCGTCTT AAAGAGTCCA ATCGTGTTCA AAGTATCTGC
ACCACACTTA AAGCATTTGG TGTGACTGTT GAGCTCTCTG AAGATCAAAC GGAGCTTGTT
ATCACTGGAG GAAAACAGCC CACGAGCTGT ATCGTAGACT CATATAATGA TCATCGCATT
GTCATGATGT CCGCTATTTT GGCTTCGTAT GCAACTGGTC CTGTTGTCAT CACCCATGCC
GAGGCAATCA ATAAATCGTA TCCACTTTTC TTTGAGCACT TTAAGCAGCT TGGTGGCATC
TGTCACAGTA TGGAGTCATA A
 
Protein sequence
MDISITPHDL FGTVEAISSK SYAHRALICA SLANGTTNIT CPYISEDIQV TVDCLKALGT 
TIARTKQGFR VVPAKEANRP QNIQLNCKES GTTFRFILPL LGALNISATI SAEGRLFSRP
LEPLISELSS HGMSFTWLDE KFLEVSGQLD GGSFEMPGNI SSQFISGLLL SLPLLNKPST
IQITGPIQSK DYLAITEHVM TDFDITTPFE VNSATYTIEA QHYVCPGVYS IEGDWSNAAP
WLVAGAIGQG VEVTGLSMQS TQGDKAILAA LSLVGARVSR QQKGAASMMD HLRPFSISIS
EVCDLAPVLA ALAAFIPGTS KLTDIQRLRL KESNRVQSIC TTLKAFGVTV ELSEDQTELV
ITGGKQPTSC IVDSYNDHRI VMMSAILASY ATGPVVITHA EAINKSYPLF FEHFKQLGGI
CHSMES