Gene Acid345_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2079 
Symbol 
ID4069930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2490823 
End bp2492952 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content61% 
IMG OID637984094 
Producthypothetical protein 
Protein accessionYP_591154 
Protein GI94969106 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000832848 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACTCCG TGCACTCGAA CCATCGCCGC AGTCACGTCT CACGCCGCAG AACCCACGTG 
CTTTGGGCTC TTCTCTCGCT CGTCGCACTC TGTGGAGTTT TGCCCGCCCA GAGCAACTCC
GACGCCTTCG ACGCCAACGC CAAACGGCCG ATCCATATCG TCCCAATCAC CAACCCTAAT
CAACCGCGGA CGAACGTCAT CAAGACGGCC GGACTCCTGC ACTATTACGG CGGGCCGGTC
ATTTCCAATC CCGACGTCAT CATCGTCAAC TGGGGAGCGG TGAGCAAGCC CGTCGGAACC
GGCGGCGTCA CCATGGCCAA TTTCCTCCAG GACGTCACCA ATACCACCTA CTGGGACATG
CTCTCCGAAT ACAACACGGC TGGGATGGTC GTGGGTGCAC ACACCGGAAA CCAGACCATC
GGGCGCGGCA CATTCAATAC AGAAATCACC ATCACGCCGA CGGTATGCAC GGCAACGACT
TGTACTGACG CCCAGATCCA AAGCGAACTC AATTCCCAGA TCACCGCCGG GCACCTCCCC
GAGCCTACGG TGGACGCCAA CGGGCACTCC AACACCATCT ACATGATCTA TTTCCCGTCA
GGCTTCGTCA TCAAACTCGA CGCATCGGAC ACCAGTTGCG TACAGTTCTG CGCCTATCAC
AGCACCGCGA ACCGGAGTGG TGTCGACCTT CTGTACGGAA TTTTTCCCGA TCTCACCAAC
CCAGGCTGCG ACGGCGGCTG CGGTGCTGGC GCGACCGATT TCGACAATCT GACGAGTGTC
TCCTCGCACG AACTGGTCGA AGCTACTACC GATGCGGAAG TCGGATATGC TTCCACCGTC
GCTTATCCCA TTGCCTGGTA CAACACATCT CAGGGTGAAA TCTCCGATAT CTGCAATGCC
CAGCAGACCC CTCTAGATAC TGCCCGCGCA ACCTACACCG TCCAGAAGCA GTATTCCAAT
CTCGCCGGCG TCTGTACCGC GAGCACGGCA AACTCTGCCA CTTCATTTGC CCTCTCTGCT
CCTTCGACGG TCGCAGCCGG TGTCCCTTTC AATGTCACCG TCACAGCTAA GAACTCAAGC
AGCGCGACCA CGGCCGCTTA CGGTGGCACC GTCCACTACA CGAGCTCCGA CGGCAGCTCG
ACCCTGCCGG CCGACGGCAA ACTGACCAGC GGCGTCAGCA CCATACAAGT CACGCTCCAC
ACCCTCGGGG CACAGAGTAT TACTGCCGCT GACGTCAACC AGCCCGGAAT TACCGGCCTC
GCGAATGTGA CGGTCACAGC CTCCTCTTCC GCGACCATGA CCAGTCCTGC GAATGGGAGC
ACCCTCTCCG GATCCTCCAC CACGTTCAAT TGGAGTCCGG GTTCTGGCGC AACCCAGTTC
TCCCTCTACG TCGGCACCGC ACCGGGCGCC CACGACGTCT TCTTCTCCAC CTTCCCGACT
TCTACGACGT CCACCACCGT GAATAGCATT CCCACCCACG GCGCGTACCT GTACGTCACG
CTCTACTCGT ATTACAGCGG AGCCTGGCAC CCCAACAGCT ATCGCTACAT CGAATCCGGC
ACGCCAGTCC CCGCAACGAT GGCCACTCCA TCGAACGGCG CTGTTCTTGC GGGAACCAGC
CAGTCGTTCA CGTGGAACGC CGGCGTTGGT GCGAATAACT TCTCGCTCTA CGTGGGAACC
GCGCCCGGTG CACACGACAT CGCGTACCAG ACCTTCGGCT CTGCCACGAC GTCCACAACC
GTAAGCGGCC TGCCCACCAA CGCACAGCCG GTTTACGTCA CGTTGTACTC CTACATCGCG
GGCACCTGGC GCGGCAACTC GTACAAGTAC TACGCGAGCG GCACAACCGC TCCAGCCACG
ATCGCCACAC CCACTCCCGG CAGCACGCTC TCCAGCGCCA GCCAATCGTT TACCTGGACC
GCCGGCACGG CTGTCTTGCA GTACTCGCTG TACGTGGGCA CCACACCCGG CGCGCACGAT
GTCTTCTACG GAACCTTCAA CCCCGGCACC ACGTCCACCA CGGTGAGCAG CATTCCAACC
ACTGGCGCGA CGCTCTACGT TCGCCTCAAC TCGTTCACGC AGGGCACGTG GCAATCCGAG
TCCTATACCT ACACCGAAGC CGGCCCATAA
 
Protein sequence
MHSVHSNHRR SHVSRRRTHV LWALLSLVAL CGVLPAQSNS DAFDANAKRP IHIVPITNPN 
QPRTNVIKTA GLLHYYGGPV ISNPDVIIVN WGAVSKPVGT GGVTMANFLQ DVTNTTYWDM
LSEYNTAGMV VGAHTGNQTI GRGTFNTEIT ITPTVCTATT CTDAQIQSEL NSQITAGHLP
EPTVDANGHS NTIYMIYFPS GFVIKLDASD TSCVQFCAYH STANRSGVDL LYGIFPDLTN
PGCDGGCGAG ATDFDNLTSV SSHELVEATT DAEVGYASTV AYPIAWYNTS QGEISDICNA
QQTPLDTARA TYTVQKQYSN LAGVCTASTA NSATSFALSA PSTVAAGVPF NVTVTAKNSS
SATTAAYGGT VHYTSSDGSS TLPADGKLTS GVSTIQVTLH TLGAQSITAA DVNQPGITGL
ANVTVTASSS ATMTSPANGS TLSGSSTTFN WSPGSGATQF SLYVGTAPGA HDVFFSTFPT
STTSTTVNSI PTHGAYLYVT LYSYYSGAWH PNSYRYIESG TPVPATMATP SNGAVLAGTS
QSFTWNAGVG ANNFSLYVGT APGAHDIAYQ TFGSATTSTT VSGLPTNAQP VYVTLYSYIA
GTWRGNSYKY YASGTTAPAT IATPTPGSTL SSASQSFTWT AGTAVLQYSL YVGTTPGAHD
VFYGTFNPGT TSTTVSSIPT TGATLYVRLN SFTQGTWQSE SYTYTEAGP