Gene Acid345_1711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1711 
Symbol 
ID4072056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2077840 
End bp2079084 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content60% 
IMG OID637983719 
Producthypothetical protein 
Protein accessionYP_590786 
Protein GI94968738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.99847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.186192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGACC GCGAGCAATT TCCGAAGCAG ATCGACCGGC TGGTCGCGAG CCGGACACTG 
CACGGATCAG AGTCGCTGTG TAAATTGCTC CGGTACCTGG CGGCGCATGC GATTGAGCAT
CCGGGCACGC CGCTGAAGGA ATATCAGATC GCGACCGAGG TCTTCGGACG AAGGCCGGAC
TTCGACCCAC AATCCGATTC GACGATCCGC GTGCAGGCCG GAAGGCTGCG GGCGAAGGTC
GCGGAGTACT ACGCGTCGGA AGGGGAAAGC GATCCGGTCG TGGTGGAGCT GCCGAAGGGA
AGCTACCTGC TGAACTTCCG GTATCGGACG CCGGTTCCTA AGGAAGAGCT GGTGAATCAT
GCACCGGTGC ATGAACCAGT GACGGTGGCG CCACAGAAAT CATACGGCGG GATGGCGGCA
ACTTTGGCCG TGCTTCTGGG ACTCGCGGTG GTGACGATTG GTTATTTGCT GCTGAATGGA
AGGGTGAAGA CGGATACTGC GTCGGCGGCG ACGGCAGGAC CGCTGGCGCC GGCAGAGTTC
CAGGTGTTCT GGAGGAAATT TCTGTCGGGT CCAGAGGAGC CCTGGGTGGT GTTCAGCAAT
GCGGAGTTTG TGGGGCGTCC GGAGACGGGG ATGCGCTACT ACGATAAGCA GCGCGATGCG
AACACGCCTC CGTACGACCA CTACACCGGC GTTGGAGAAG TGCTTTCCAT TCACGAACTG
GACCAGGTGT TCAATTCGTT GCACCGGCGG ATTCGGGTGA AGCGCGGCAG TTTGTTTTCG
CTGGATGACG CGAAGAACAA CGATTTGATA TTTATTGGTT CGCCGTCAGA AAACCTGACA
CTGATGGAGA TTCCGAGCAC GGATGACTTC CGGTTCGATC GAGTGAAGAC AGGACCGAGG
GCCGGCGACC TTGCGGTGAT CAATGTGCAT CCGCAAGCGG GAGAGCAACC GTTCTACCTA
GCGAGTCGGG CGGGTGATCC GCTCGTTGAA GACTACGCGG TGGTGGGAAT GATGCCGGCG
TTGAATCCGC AGCGGACGGA AGTGATCCTC GCCGGAACGA CGACGTTCGG TACGCAGGCG
GCGGTGGAAT ATGTTTGCCG GCAGAGTTCG GTGAAGCAAT TACTCGATCG GCTTGGAACG
TCGGGGGGCG AGGTGAAGCC ATTCGAGGCG ATCCTGCATA TTAAGGTGGC GAAGGGCGTG
CCGGTCGAGA CGGAGTTGGT CGCGGTACGG CTGAGGAACC AGTAG
 
Protein sequence
MVDREQFPKQ IDRLVASRTL HGSESLCKLL RYLAAHAIEH PGTPLKEYQI ATEVFGRRPD 
FDPQSDSTIR VQAGRLRAKV AEYYASEGES DPVVVELPKG SYLLNFRYRT PVPKEELVNH
APVHEPVTVA PQKSYGGMAA TLAVLLGLAV VTIGYLLLNG RVKTDTASAA TAGPLAPAEF
QVFWRKFLSG PEEPWVVFSN AEFVGRPETG MRYYDKQRDA NTPPYDHYTG VGEVLSIHEL
DQVFNSLHRR IRVKRGSLFS LDDAKNNDLI FIGSPSENLT LMEIPSTDDF RFDRVKTGPR
AGDLAVINVH PQAGEQPFYL ASRAGDPLVE DYAVVGMMPA LNPQRTEVIL AGTTTFGTQA
AVEYVCRQSS VKQLLDRLGT SGGEVKPFEA ILHIKVAKGV PVETELVAVR LRNQ