Gene Acid345_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1158 
Symbol 
ID4069967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1443750 
End bp1444976 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content60% 
IMG OID637983168 
Producttryptophan synthase subunit beta 
Protein accessionYP_590235 
Protein GI94968187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.723967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCA AACCAATCGC GAAAGTTCGT AAGTCGGTAC CAAAGAAAGT TGAGTCGCAG 
GTCGGCTACT TCGGCTCGTA TGGCGGACGC TTTGTCCCCG AGACTCTGAT GGCCGCACTC
CAGGAGCTTG AGGCTGCCTA CGAAGCAGCC AAGCGCGACA AGAAGTTCAA GGAAGAAATC
GAGTCGCTGC TGCGTGAGTA CGCGGGACGT CCGACGCCCC TGTTTTTGGC CAAGAATCTT
ACGCAGAAAC TGGGCGGGGC GAAGATTTAC CTGAAGCGCG AAGACCTTCT CCACACAGGC
GCCCACAAGA TCAACAACTG TATCGGTCAG GGGCTGCTCG CGCGCCGCAT GGGCAAGCAC
CGCATTATCG CCGAGACGGG CGCTGGACAG CATGGAGTCG CCAGCGCGAC GGTTGCCGCA
CTTTTCGGCA TGGAGTGCGT GGTTTACATG GGCAGCGAGG ACGTCCGACG GCAGGAACTG
AACGTCTTCC GGATGAAGTT GCTCGGGGCA GAAGTCGTCT CCGTCAACTC CGGTTCGCGC
ACACTAAAGG ACGCCATCAA CGAAGCCATG CGCGATTGGG TTACAAACGT CCGCACCACG
CATTACCTGC TGGGCAGTGT GCTGGGCGCA CATCCGTATC CGATGATGGT CCGCGACTTC
CATCGCGTGA TTAGCCGCGA GGCCAAGGCG CAGATCATGA AGGCCGAGGG CAAGCTGCCG
ACCGCGATCA TCGCTTGTGT GGGCGGAGGA TCCAACGCGA TTGGCGCGTT CTACGAGTTC
ATCGGCGACA AAAAGGTCCA GCTCATCGGC GTGGAGGCTG GCGGTCGCGG CAAAGCGCTG
GGCGAACATG CCGCACGCTT CCGCGGTGGA GCACCAGGCG TGCTTCAGGG CACGTATTCT
TACGTCCTTC AGGATGAACA CGGCCAGATT GCTGGCACGC ATTCGGTGTC GGCGGGATTG
GATTACCCGG CGATTGGTCC GGAGCACGCC GCTCTTGCCG AAGCGGGACG CGCCGAGTAT
GTAGCCGCCA GTGACGCAGA AGCGCTTGCA GCGTGCAGTA TGCTTGCTAA GACGGAAGGC
ATTATTCCGG CACTGGAGTC GTCGCACGCG GTGGCGGAGT GTGTTCGCCG CGCGCCGCAG
ATGCGTAAGA GTGACGTCGT CATCGTCAAT ATCTCGGGAC GAGGCGATAA AGACATCGGT
ATTCTCCGAG AGCAACTTCG GTTTTAG
 
Protein sequence
MATKPIAKVR KSVPKKVESQ VGYFGSYGGR FVPETLMAAL QELEAAYEAA KRDKKFKEEI 
ESLLREYAGR PTPLFLAKNL TQKLGGAKIY LKREDLLHTG AHKINNCIGQ GLLARRMGKH
RIIAETGAGQ HGVASATVAA LFGMECVVYM GSEDVRRQEL NVFRMKLLGA EVVSVNSGSR
TLKDAINEAM RDWVTNVRTT HYLLGSVLGA HPYPMMVRDF HRVISREAKA QIMKAEGKLP
TAIIACVGGG SNAIGAFYEF IGDKKVQLIG VEAGGRGKAL GEHAARFRGG APGVLQGTYS
YVLQDEHGQI AGTHSVSAGL DYPAIGPEHA ALAEAGRAEY VAASDAEALA ACSMLAKTEG
IIPALESSHA VAECVRRAPQ MRKSDVVIVN ISGRGDKDIG ILREQLRF