Gene Acid345_4150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4150 
Symbol 
ID4072341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4909720 
End bp4910865 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content62% 
IMG OID637986181 
Producthypothetical protein 
Protein accessionYP_593224 
Protein GI94971176 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.280981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAA CTCCCAGCAC TCCTGGCGAC CCGAAGAATG CGCATTTTCC CTGCCCCGGC 
TGTTCGGCGG ACATGAAGTT CGACCCCGCG TCGGGGATGA TGAAGTGTCC GTTCTGCGGA
ACCACCTCGG CGGTGCCGGC ATTGAAGCAA ACGACATCGG CGGTAGCGTC GCCATCACCG
GGGCACCTGG ATTGCCATCC GCTGGAAGAG TTCCTGGCGA AGGCGCACGA CGGGCAGCTG
ACGCAGCTCG CGCCACAGGC GCTGGAAGTG CATTGCGCGG CCTGCGGCAG TTCGGTGACG
TTCCAGCCGC CGGAGGTGGC AGGGGTTTGT CCGTTCTGCG GATCGGCAAT TGTGGCACAG
GCGAAGGCCG CGGACCCGCT GCTGGCACCG GATGGCGTGC TTCCGGCAAA GATCGTGAAG
CAGCAGGCGC AGGGCGAGGT GAAGCAGTGG CTGAGCTCGC GCTGGTTCGC GCCCAACGCG
CTGAAGACGA TGGCGCGGCA GGAAGGCATC AACGGCGTGT ACCTGCCGTT CTGGAGCTAC
GACGCGGACA CCGCCAGCAA TTACACCGGA GAGCGAGGCA TCAACCGCAC CGAGACGGAG
AGCTATACCG ACAGTTCAGG CAACCGGCAG ACCCGGTCGC GCACTGTGAC GGACTGGTGG
CCGTGCTCCG GTCATGTGAA CGTGAATTTC CATGACGTGC TGATCGCCGC GTCGCGCTCG
GTGCAGGAAA AGAAGCTGGA TGCGCTCGAA CCGTGGGGGC TGGAAGCGTT ACAGGCCTTC
GAACCCGCGT ATTTAGCGGG CTTCAAGGCG CAACGCTACC AGGTGCAATT GGCAGACGGC
TACACCGAGG CGAAACAGGT GATGGCCAAC GGGATCGAGC AGGCGATCCG GCGGGACATT
GGCGGCGATG AGCAGCGGAT TTCGTCGGTG GACTCGACGT ACTCGAATGT CGGGTTCCGA
CATTTGTTGC TGCCGGTGTG GATTGGGGCT TATCGCTTCC AGAACAAGGT GTACCAGGTG
GTGGTGAATG CGGCGACGGG CGAGGTGCAG GGAGATCGGC CGTACAGCGC GGTGAAGATT
GCGATGTTGG TGATCTTTAT CATTTTCGTG ATTTTGATTT TAGCGATGAT TGGGGGAAAG
CACTGA
 
Protein sequence
MATTPSTPGD PKNAHFPCPG CSADMKFDPA SGMMKCPFCG TTSAVPALKQ TTSAVASPSP 
GHLDCHPLEE FLAKAHDGQL TQLAPQALEV HCAACGSSVT FQPPEVAGVC PFCGSAIVAQ
AKAADPLLAP DGVLPAKIVK QQAQGEVKQW LSSRWFAPNA LKTMARQEGI NGVYLPFWSY
DADTASNYTG ERGINRTETE SYTDSSGNRQ TRSRTVTDWW PCSGHVNVNF HDVLIAASRS
VQEKKLDALE PWGLEALQAF EPAYLAGFKA QRYQVQLADG YTEAKQVMAN GIEQAIRRDI
GGDEQRISSV DSTYSNVGFR HLLLPVWIGA YRFQNKVYQV VVNAATGEVQ GDRPYSAVKI
AMLVIFIIFV ILILAMIGGK H