Gene Acid345_0164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0164 
Symbol 
ID4070076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp175993 
End bp177801 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content60% 
IMG OID637982164 
Producthypothetical protein 
Protein accessionYP_589243 
Protein GI94967195 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.935603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0626406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCTGC CGGTTCTTTC CCGTCGCTCG TTCTTGTCCT CCGCTTCGCT CGCTGCCGCC 
AGCCTGCCGT TCCTCCGCTC CTCCGCATTT GCGATTTCGA GCGTCCCGCT CGACGAGTTT
GGCTATGGCG ACGTCTCCCT TGAGAGCGAG TTACACAATC GTCAGTTCCA GAACACGCAC
GATGTTCTGA TGGGCTTGGA AGATGACGCG CTGCTCAAGC CGTTTCGCGC CATGGTCGGC
CAGCCCCCGC CGGGGCGCGA CCTCGGTGGC TGGTATTGCT TCGATCCGAA CTACAACCCG
AATGATGTTG GCGTGGGCTT TGCGCCGACC GCAACCTTTG GGCAGTGGAT CTCGGCGCTT
TCACGTTCTT ATGCGCTTCG TCCTGATCCG GCAGTGCGGG ACAAAGTGAT TCGGCTCAAC
CGGCTTTACG CGCAGACCAT TTCACCTGAG TTTTACGGCC TGAAGAACCG CTTCCCCGCG
TACTGCTACG ACAAGCTGGT TTGCGGATTG ATCGACGCTC ATCAATATGT CGGAGATCCC
GATGCGCTCA AAATTCTGGA GCGCACGACC GACACCGCGA CTCCCTTGCT GCCGGGCCAC
GCGGTTGAGC ACGGTACGGT TTGGCGGAGC GTGAAGGACG ACGGTTACAC CTGGGACGAG
TCGTACACGA TCTCGGAGAA CTTGTTCCTC GCCTATCGTC GCGGGGCTGG GGATCGCTAT
CGCGCGCTGG GAAAGCAGTA TCTCGACGAC ACCTACTACA ACCCGCTGGC CGAGGGCCGC
AGCGATCTTG AGGGACGGCA TGCCTATAGC CACGTGAACT CGCTTTGCTC GGCGATGCAG
GCGTATCTCA CACTCGGTGA CGAAAAGTAT TTCCGCGCGG CGAAAAACGG CTTCGACTTT
GTGCTGGCGC AAAGCTACGC TACGGGCGGA TGGGGCGCCG ATGAAACTCT GCGTGCGCCG
AACAGCCCCG AAGTCGCCAA GAGTCTTACA GGGACCCATC ATTCGTTCGA GACACCCTGC
GGTTCGTATG CGCACTTCAA ACTCACGCGC TATCTGCTGC GCGTGACCCG CGACTCGCGC
TATGGCGACA GCATGGAACG CGTGATGTAC AACACCATCC TCGGCGCGCT GCCGCTGATG
CCCGACGGCC GCACGTTCTA CTACTCCGAC TACAACTTCA AGGGCAGCAA GTTCTACCAC
GACGCGCGCT GGCCCTGCTG CTCCGGCACT ATGCCGCAGA TCGCGACCGA CTACGGCATC
AGCACGTATC TTCGCGACCC ACAGGGGATC TACGTCAACC TGTATATTCC ATCGACGGTG
CGCTGGCAGC AGGACGGGGC CCAAGTTTCC CTCACACAGA AGACCGCGTA TCCGTTCGAT
CCGGTTGTCG AGATTGAACT TTCGACCACG AAGCAGCGAG AGTTCGAGGT TCACCTGCGG
ATTCCGGCCT GGGCTGAGCA GGCATCCATC GAGGTAAACG GAAAGCGGGA AGGGGTACCC
GTAGCGGAGC GGTTCGCGAC CATCCGGCGG ACTTGGAAGA ACGGCGATCG GATTCAGTTG
GAGCTCCCGC TCAAGAATCG GCTAGAACCG CTGAACCGCG AGCGTGCGAA GCTGGTTGCA
CTCCTGAATG GCCCGCTGGT ACTGTTTCCG ATCGGCGAGA AGGCCCAGCA ACTCACTCAA
GGGCAATTAC TTGCCGCGAA GCGCGCCGGG AGCGCCTGGC GGGCGGAAAG TACCGGCGGC
CCGGTCAAAC TGTTGCCCTG GACGGAGATA CAAGATCAGC CCTATTCAAC CTACGTTCAG
CTTGCCTGA
 
Protein sequence
MALPVLSRRS FLSSASLAAA SLPFLRSSAF AISSVPLDEF GYGDVSLESE LHNRQFQNTH 
DVLMGLEDDA LLKPFRAMVG QPPPGRDLGG WYCFDPNYNP NDVGVGFAPT ATFGQWISAL
SRSYALRPDP AVRDKVIRLN RLYAQTISPE FYGLKNRFPA YCYDKLVCGL IDAHQYVGDP
DALKILERTT DTATPLLPGH AVEHGTVWRS VKDDGYTWDE SYTISENLFL AYRRGAGDRY
RALGKQYLDD TYYNPLAEGR SDLEGRHAYS HVNSLCSAMQ AYLTLGDEKY FRAAKNGFDF
VLAQSYATGG WGADETLRAP NSPEVAKSLT GTHHSFETPC GSYAHFKLTR YLLRVTRDSR
YGDSMERVMY NTILGALPLM PDGRTFYYSD YNFKGSKFYH DARWPCCSGT MPQIATDYGI
STYLRDPQGI YVNLYIPSTV RWQQDGAQVS LTQKTAYPFD PVVEIELSTT KQREFEVHLR
IPAWAEQASI EVNGKREGVP VAERFATIRR TWKNGDRIQL ELPLKNRLEP LNRERAKLVA
LLNGPLVLFP IGEKAQQLTQ GQLLAAKRAG SAWRAESTGG PVKLLPWTEI QDQPYSTYVQ
LA