Gene Acid345_3614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3614 
Symbol 
ID4070134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4275534 
End bp4276841 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID637985637 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_592689 
Protein GI94970641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0604352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.343798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAAGTG ATCAGCACAG CGCAGTGAAG TCGGTCCGTG TTCAACCCGC AAGCAATATT 
CTCGGCAGCG TTTCCTTCCC CGGCGACAAA TCCATCTCGC ACCGCTACGG CATGCTCGCC
GCGCTCGCCG AAGGTACGTC GCGCTTCAAG AATTTCTCTA CCGGCGCCGA TTGCGCCAGC
ACGCTTTCCT GCATGGAACA GCTTGGTGCG AAGGTCACGC ATAAAGACGA CGGCGTGATC
GAGGTTGAAG GCGTCGCCGG ACAGTTGCGA AAGTCTGCAA CTCAACTGGA CTGCGGAAAC
TCCGGTTCCA GCATGCGCAT GTTGTCTGGA ATTCTGGCGG CGCAGCCCTT CGATAGCGAA
CTCATGGGCG ATGCCTCGCT CTCTCGCCGT CCCATGCGCC GCATCGTAGA CCCACTGAAG
CAGATGGGCG GGGACATCGA AACCACCGAC GGCCACGCCC CGCTGCGCGT TCGCGGCAGC
AAACTTACGG CGATTGACTA CATCACGCCG GTTCCCAGCG CGCAGGTGAA GTCGTGCGTG
CTCTTCGCCG GCGCGTTCGC CGCCGGCATT ACTTCGGTGG ACGAAGCGAT CCGCACCCGC
GACCACGGCG AAATCGCGCT AAAAGCCTTC GGCGCCGAAG TAGAACGCCG CCAAAACCGC
GTCAGCGTGC GCGGCGGGGC CAAGTTTCGC GCCATCGAAG CGGTTGTCCC CGGCGACATC
TCTTCCGCCG CGTTCTTCCT CTGCGCGGCC GCGCTCTTCC CAACCTCGAA CCTCGTCTTC
GACGGCATCC TGCTCAATCC ATCCCGCGCC GCGATCCTCG ATGTCCTCGC TTCCATGGGC
GCCAAGCCGA AGTTCCTCCA GGTGCAGGAG CAGCACGGCG AACTCGTGGG CACCATCACG
CTGGCGCCGG CGGGCCTCTC TGGATTGAAA ATTTCCGGTG GCCTCACGGC TTCGCTGATT
GACGAACTGC CTGTGCTTGC CGCCATCGGT GCGTACACGC GTTATGGCAT CGAGATCCGC
GATGCAAAAG AGCTGCGCGT GAAAGAAAGC GATCGTATTG CAGTCGTCTG CGCGAACCTG
CGCGCGATGG GCGCCGAGGT CGAAGAGTTC GACGACGGCC TGCGCGTGAA GGGAAGTCAG
AAACTGCACG GCGCCGAGAT CGAATCACAC GAAGATCACC GCATAGCGAT GGCTTTTGCG
GTCGCCGCGC TGCGTGCCGA GGGCGAGACG GTCATCAACG GTGCCGACTG CGTTGCCATC
TCGTACCCGG AGTTTTTCGA CACGCTGAAC AAGGTTGTGG AACGCTAA
 
Protein sequence
MASDQHSAVK SVRVQPASNI LGSVSFPGDK SISHRYGMLA ALAEGTSRFK NFSTGADCAS 
TLSCMEQLGA KVTHKDDGVI EVEGVAGQLR KSATQLDCGN SGSSMRMLSG ILAAQPFDSE
LMGDASLSRR PMRRIVDPLK QMGGDIETTD GHAPLRVRGS KLTAIDYITP VPSAQVKSCV
LFAGAFAAGI TSVDEAIRTR DHGEIALKAF GAEVERRQNR VSVRGGAKFR AIEAVVPGDI
SSAAFFLCAA ALFPTSNLVF DGILLNPSRA AILDVLASMG AKPKFLQVQE QHGELVGTIT
LAPAGLSGLK ISGGLTASLI DELPVLAAIG AYTRYGIEIR DAKELRVKES DRIAVVCANL
RAMGAEVEEF DDGLRVKGSQ KLHGAEIESH EDHRIAMAFA VAALRAEGET VINGADCVAI
SYPEFFDTLN KVVER