Gene Apar_0662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0662 
Symbol 
ID8413522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp737597 
End bp738739 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content48% 
IMG OID645022239 
Productchorismate synthase 
Protein accessionYP_003179682 
Protein GI257784465 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACCT CAACTTTTGG GACAACAGTA AAAGTATCTA TTTTTGGTGA GTCTCATGCT 
CCAAAAATCG GCTGCACAAT TGAAGGTTTA CCTGCAGGGT TTACGGTTGA CTTGCATGAG
CTTAAGACAT TTCTTCAAAG AAGATCTCCG TCGCATCCGT GGGATACTCC GCGTAAAGAA
ATTGACAATC CGAAGTTCGT CTCTGGAATT TCAGCAAAGG GTATTCTGGA TGGTTTCCCA
CTCACAGCGG AGTTGCCTAA CAACAATGTT CGGCAAAAAG ATTACACAGC TACAAAGCTT
GTTCCACGTC CAGGACATGC CGATTTTTCT GCCTGGGCAA AATGGGGAAA CTCCTACAAA
CAAACGGGTG GAGGACATTT TTCTGCTCGC CTCACAGCCC CTCTCTGCAT TGCGGGAGGC
ATAGCGCTCC AGATTTTACA CGCTCAGGGC ATCACTATTG CAGCTCACGT GCTCAAAATA
AAGAACATAA ATGATACTCC TTTTAAGCTC ATAGACAACT CCGTTGAAGC CAATAAGCTG
CTTGCTCTCC AAATGAATCA GCTGCTACAA GCCGCTCCTC AAGAACTTCC TTTCCTCGAT
GCTCAGACTG GCGAGAAAAC CCGTGCACTC CTCACACAAC TGCGCTCCGA GAAAAATACT
GTTGGTGGCA TCATAGAGTG CGTAGCAACA GGAGTTCCTG CAGGCATTGG TTGCCCTCAT
TTTCAAGGAC TTGAAAATAC AATCTCCGCC GCAGTCTTTG GTGTTCCCGC TGTTAAAGCC
ATAGAATTTG GCAGCGGTAT GAACGTTGCA AATTTACTTG GCTCTGAAAA CAACGATGCC
TACGAGGTAC GCGACGGCTC TGTTGTACCA ACCACCAATC ACGCTGGAGG CATTCTGGGC
GGCATCTCAA CGGGCGCTCC CATCTGGTTC AGATGCGCAC TTAAGCCTAT ATCAAGCATT
GGGCTTTCTC AACATTCAGT TAACCTTCAA ACTATGGAAT CAGAGCAGCT TGTGGTCCAG
GGCAGACATG ACGTAACTGC AGTCTTACGT GCTGTGCCTT GCGTTGAGAG CGCGTTTGCA
CTAGCACTTC TCGACACTTT GTATTCCTGG CCCTCTGAGC AGAACGGATA TCACAATGAC
TAA
 
Protein sequence
MMTSTFGTTV KVSIFGESHA PKIGCTIEGL PAGFTVDLHE LKTFLQRRSP SHPWDTPRKE 
IDNPKFVSGI SAKGILDGFP LTAELPNNNV RQKDYTATKL VPRPGHADFS AWAKWGNSYK
QTGGGHFSAR LTAPLCIAGG IALQILHAQG ITIAAHVLKI KNINDTPFKL IDNSVEANKL
LALQMNQLLQ AAPQELPFLD AQTGEKTRAL LTQLRSEKNT VGGIIECVAT GVPAGIGCPH
FQGLENTISA AVFGVPAVKA IEFGSGMNVA NLLGSENNDA YEVRDGSVVP TTNHAGGILG
GISTGAPIWF RCALKPISSI GLSQHSVNLQ TMESEQLVVQ GRHDVTAVLR AVPCVESAFA
LALLDTLYSW PSEQNGYHND