Gene Acid345_3808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3808 
Symbol 
ID4071092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4502223 
End bp4503908 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content56% 
IMG OID637985831 
Producthypothetical protein 
Protein accessionYP_592882 
Protein GI94970834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.356806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.153495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG ATTACAACTC CCGAATAACG ACCCATCTGA TTGTCATCGC GATCTTGCTC 
GTCTTTACAA TGTCAGCGAC GGCTCAATCT CTCATTTCCA AGCGCTCCCT CGGATCTGCA
TACCTTCCTC TCGATCATTG GGCCTACCCC GTGCTCGAAC GGGCAATCGC AAGGGGTGGA
ATTCCAGCGC AATTCATGGG CATCCGTCCG TGGACCAGGA TGGCCATCGC GGATCTGCTC
GAACAGCGCC GCCGGCAGCC TGGCAAATTC GCGAACGATG ATGAGTCGCT TCGAATGATC
GCTGCACTCG AGAAAGAGTT TCAGTACGAA CTTGGTGTGG TGGAAGGTGA GTCAGTCGGC
GACGCGCAGA TCACATCAGT CTACAGTCGT ATCACTGGGA TCTCGGGAGA CTCACTTCGC
GACGGATTCC ATCTCGGCCA AACCATCGTG AACGACTACG GACGTCCATA CGGGAACGGT
GTCAACAACG TGACGGGCGC CGGTTTCTCC GCTGACTATT CCGCGTTCAC CGCCGTATTA
AGCGGGGAGT ACCAGCATTC CTCCGAAGGC ACGCGATACT CTGCCGCCGC GCAATCTACG
CTTTCTAACG TCGACCGTAC CTCCGCTCTC ATTTCGCCCG ATCTCAATGA CGTAGATCAC
GGCACATTCA TGGATACCTA CGTTGGCGGC ACCTGGCACG GATGGGATCT CTCCACCGGC
AAAGAGAGCA TCTGGTGGGG CACCACTGAG GACAGCTCGC TCACGCTTAC AAATAACGCC
GAACCAATGT TCATGGGGAG AGTTGACCGT ACTATTCCTT TGCATTTCCC TTGGATATTC
AAATATCTCG GAGAAGTTCG CATGGACTTC TTCATGGCGA AGATGGAAGG GCACCAATAC
CCCGCTGGGC CTTGGTTTCA CGGCGAAAAA GTCAGCCTTA TGCCAACCAA GAATCTTGAA
ATCGGATTCG CCCGTACCAC CGTGTGGCTC GGCACCGGAC GCCCGTTTTC CTGGCATGCA
TTAGCGAAGA CATACTTCAG CGTGGGTGAT CAGGTAACAA ACTCCAATAC CGCACAGAAT
GATCCCGGAG ACCGTCGCGG CGAACTCGAC GTTCGCTACC GCGTGCCCGG CATTCGCAAC
TACATGACGG TATACTTCGA CTCTCTTGTG GACGACGATC CATCTCCGTT AGCGTCGCCG
CACCGCGCGG CATTTCATCC AGGGTTCTAT CTCACTCAGA TTCCCAAGGT TCCCAAGCTC
GATTTCCGGG CAGAAGGGGC TTTCACGCAG TTGAGCGGCG ACAATCGTGG TGGCCATTTC
TTCTATTGGA ACGGCGTCTA TCACGACGCC TATCTCAACA ACAGCATGCT GCTTGGCGAT
TGGGTCGGAC GCGAAGGCAT TGGTGGCCAG GCAACCGCTC GCTATTGGCT GACTCCCCAT
AACACAGTTG AGCTCTCGTA TCGGCGCAAC CAGCTCGCCA CCGACTTCAT CCCTGGTGGC
GGCTACCAGC AAGACATCTC CGCTCAAACC CGCTTTCATC TCCGCGGAGA TATCTTCCTC
TCCGGCGGCC TTCAGTACGA ACAATACCGC ATACCATTGC TCGCCAACGG GCAACAGAAT
AATTTTGCGA CGACGATCGG CTTTACTTTT GTACCGGGTG CTAAACGACG CATGCCCCAA
CCCTGA
 
Protein sequence
MRFDYNSRIT THLIVIAILL VFTMSATAQS LISKRSLGSA YLPLDHWAYP VLERAIARGG 
IPAQFMGIRP WTRMAIADLL EQRRRQPGKF ANDDESLRMI AALEKEFQYE LGVVEGESVG
DAQITSVYSR ITGISGDSLR DGFHLGQTIV NDYGRPYGNG VNNVTGAGFS ADYSAFTAVL
SGEYQHSSEG TRYSAAAQST LSNVDRTSAL ISPDLNDVDH GTFMDTYVGG TWHGWDLSTG
KESIWWGTTE DSSLTLTNNA EPMFMGRVDR TIPLHFPWIF KYLGEVRMDF FMAKMEGHQY
PAGPWFHGEK VSLMPTKNLE IGFARTTVWL GTGRPFSWHA LAKTYFSVGD QVTNSNTAQN
DPGDRRGELD VRYRVPGIRN YMTVYFDSLV DDDPSPLASP HRAAFHPGFY LTQIPKVPKL
DFRAEGAFTQ LSGDNRGGHF FYWNGVYHDA YLNNSMLLGD WVGREGIGGQ ATARYWLTPH
NTVELSYRRN QLATDFIPGG GYQQDISAQT RFHLRGDIFL SGGLQYEQYR IPLLANGQQN
NFATTIGFTF VPGAKRRMPQ P