Gene Acid345_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2278 
Symbol 
ID4073272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2701981 
End bp2703048 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content58% 
IMG OID637984294 
Producthypothetical protein 
Protein accessionYP_591353 
Protein GI94969305 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGTC GTCTCTTGTT TTTTCTGGCG CTAGTCTCTG TCTCTACCCT GCTCGCGCAA 
TCCAAGCCAG CCACCCAGGA AGGCGACTTC GTCCTTCACG ACTTCACCTT CCGCTCCGGC
GAAAAGCTTC CCGAAGTTCG CATGCACTAC ACCACGCTTG GCAAGCCAGC GAAAGATGCG
AGCGGCCGCG TGACCAACGC CGTGCTCATC TTGCACGGGA CTGGCGGCTC CGGCGCACAA
TTTCTGCGTG CGCAATTTGC AGACGTCCTC TACGGGCCCG GGAGGTTGCT CGATGCCACC
AAGTACTTCA TCGTCCTACC CGACAACATC GGCCACGGCA AATCCAGCAA GCCCAGCGAT
GGTCTCCACG CTCGGTTTCC GCAATACGAC TACGACGATA TGGTGCTGGC GCAGCACGAA
CTGCTGGAAA AGGGCCTCGG TGTGAATCAC CTTCGCTTGA TCCTTGGCAC CTCGATGGGC
TGCATGCACT CGTGGGTCTG GGGAGAGACG TATCCCGATT TCATGGACGC GATGATGCCG
CTCGCGTGCC TGCCGGTGCC GATCGCGGGA CGCAATCGAA TCTGGCGAAA GATGATCATC
GATGGCATCA AGAACGATCC GGAGTGGAAG AACGGCGACT ACACCACGCA GCCACACGCG
GGTATCGAGA TTGGCACCGA CTTCCTCATC ATCGCCGGCA GCGCGCCGAT ACCGATGCAG
AAAGGTGAAC CAACCCGCGA TGCCGCCGAC AAATATCTTG ACGACACGTT CAAGCGGCAA
TCCGCCGGAC TTGATGCCAA TGACCTGCTC TATGCTGTCA GCGCTTCGCG CAATTACGAT
CCGTCGGCCA AACTCGATGC CATCAAAGTC CCCGTGATGT TTGTAAATTC CGCCGACGAC
TTCATCAATC CGCCGGAACT CGGCATTGCC GAGCAGGAGA TCAAGAAAGT GAAGCGCGGC
AAGTTCGTTC TCATTCCCGC CTCCGACCAA ACGCACGGAC ACGGCACACA TACGTGGGCT
GTCATCTGGC AGAAATATTT GAAGGACTTG CTGGAAGAAT CGAAGTAG
 
Protein sequence
MLRRLLFFLA LVSVSTLLAQ SKPATQEGDF VLHDFTFRSG EKLPEVRMHY TTLGKPAKDA 
SGRVTNAVLI LHGTGGSGAQ FLRAQFADVL YGPGRLLDAT KYFIVLPDNI GHGKSSKPSD
GLHARFPQYD YDDMVLAQHE LLEKGLGVNH LRLILGTSMG CMHSWVWGET YPDFMDAMMP
LACLPVPIAG RNRIWRKMII DGIKNDPEWK NGDYTTQPHA GIEIGTDFLI IAGSAPIPMQ
KGEPTRDAAD KYLDDTFKRQ SAGLDANDLL YAVSASRNYD PSAKLDAIKV PVMFVNSADD
FINPPELGIA EQEIKKVKRG KFVLIPASDQ THGHGTHTWA VIWQKYLKDL LEESK