Gene Acid345_0557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0557 
Symbol 
ID4073046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp685027 
End bp686007 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content61% 
IMG OID637982562 
Productaminodeoxychorismate lyase 
Protein accessionYP_589636 
Protein GI94967588 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.245938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0550645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAGT TCTTCAGCTT CGTGCTTCTC CTCGTCCTCG CCGTCGCGGG ATGGCTTGCC 
TGGGCGCTCT ACCTGCCCGT CGCGCCGTCA GAGCCGAAAT TCGTGCTGCT GCGTCCCGGC
TGGACAACGC GCCATATCGC CCGCGAGCTG AAGGACAACG GCATCATTCG CTCCGACAAA
GCGTTCCTCT TCATGCATAT CCTGCGCGGC GAGCGCAGCC TGAAGGCCGG CGAATATAAA
TTCGATAGCC CGGCGAATGC GCTGAACGTC CGTGATCGCC TCACCCGCGG CGACATCTAC
GTTCGCCAGG TCACGGTCCC CGAGGGCTAC AACATGTTCG ACATCGCACA GGCGGTCGAA
CAGGCGGGCC TCGGCACTGC CGCCGAATTC CTCAACGCGG CACGCCAGGA TTTGTTCCTG
CTCAAAGATG TCGATCCGAC AGCGAAATCC CTCGAAGGCT ATCTTTTCCC GGACACCTAT
TCGTTCACGC GCACCATGTC ATCGCATGAC ATGGCCACCG CCATGGTGCA TCGCTTCAAG
CAGGAGGCGA AGGCACTCAA TCTCGACAGC GATGTCCATC GCGTGGTGAC GATGGCCTCG
ATCGTGGAAA AAGAAACCGC AGTTCCCGAC GAGCGCCCCC AGGTCGCCAG CGTCTATTAC
AACCGGCTCG ACAAGAACAT GACGCTCGCT GCCGACCCGT CGGTGATCTA CGCCGCGCTC
CTCAATAACC GCTACCGCGG CACCATCTAC CAGTCCGACC TGCAGTACGA CTCGCCCTAC
AACACCTATA AGTACGCCGG GCTTCCACCG GGGCCAATCG CCAATCCAGG CCGCGCGGCG
CTCGCCGCAG CCATGCATCC GGCGCAAACG CAGTACCTCT ATTTCGTCGC CGACGCGCAG
GGCCATCACC GTTTCGCAGC AACCCTCGAC GAGCACAATC GCAACGTGCT GGCCTACCGG
CGGGCAATAG CGGCGAAATA A
 
Protein sequence
MRKFFSFVLL LVLAVAGWLA WALYLPVAPS EPKFVLLRPG WTTRHIAREL KDNGIIRSDK 
AFLFMHILRG ERSLKAGEYK FDSPANALNV RDRLTRGDIY VRQVTVPEGY NMFDIAQAVE
QAGLGTAAEF LNAARQDLFL LKDVDPTAKS LEGYLFPDTY SFTRTMSSHD MATAMVHRFK
QEAKALNLDS DVHRVVTMAS IVEKETAVPD ERPQVASVYY NRLDKNMTLA ADPSVIYAAL
LNNRYRGTIY QSDLQYDSPY NTYKYAGLPP GPIANPGRAA LAAAMHPAQT QYLYFVADAQ
GHHRFAATLD EHNRNVLAYR RAIAAK