Gene Acid345_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0780 
Symbol 
ID4069525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp963353 
End bp965593 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content62% 
IMG OID637982786 
Producthypothetical protein 
Protein accessionYP_589859 
Protein GI94967811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0567586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.143797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTGC GCCTCGATTT CGTAGATGAT TTCCTCGTGT CATCCATGCC ATCCAATGTG 
GCTTCGCAGT CCGCAAGTCC GCCGCGCGGC GAATCTGTGC GCTATTGGAT CGCCGTTGGC
CTCGCGCTTT CCTACGCATT CTTCGCCGGA CTGAAGACCG CGGCCGATCC CGATCTCGGC
TGGCAACTTG CCGCCGGTCG CTGGATGCTT GAGCACCACC AGATCCTGCG CACCGACGTC
TTCACCTACA CCGGCTTCGG ACGCGAATGG ATTTACCCGG CCCTTTCGCA AATCTTCGAA
TACCTCCTGT ATCGCATCGG CAGCTACTCG CTGCTCTCGT GGACAGCCGC TGTTGGTTGC
GTCGCGACCG TAGCGCTTCT CCTGCATGGA GCACGCTTTG CCACCGCAAT CATCGCGATT
CTCGCCATAC CTCTACTTGC AGCGCGTTGC GTCATGCGCG CTGAACTCTT CTCGGTCATT
CTCTTCGCTG CGTTCGTCTC CATCCTTTGG AACTTTCATC GCTCCCGTCG CGGACTGCTG
TGGATCCTGC CGCTCTTGAT GGCGCTTTGG GTGAACCTGC ACCTCGGCTT TTTGGCGGGC
TTCGGGATGT GCGCAGCCTA CGTGTTGCTC GAGATTGGCG AGCTCTTCAC GCTGCAGAAA
CGCAGTGATG CCCTTTCTCG TCTCCGCTCC GCCGCGCCAT GGCTGCTCGC AACCATACCC
GCGACCCTGC TCAATCCCTG GGGATGGCGC GTCTACGCTG GCATGTTCCG CCTCATGCCG
ACGGGCACCA ACCCCTTCAT CCTCGAACTC ATGCGCGTCC GCGTGAACTC GACCACAGCA
ATGCAAGCCT TCGCGTGGCG CGACTACGAG AGCGCGTTCT TCTGGTTCCT CGCCATCGCC
GCCATCTGCA CCGTTGCAGC TCTCATGCAA CGTCGTTTCG CCGAAGCCGT CATCCTCGTA
GGATCCGTGT ACGCTGCGGT TCACGCATCG CGGTTCGTAG CGATGTTCGC CATCATCGTC
GTCGTAATCG GCGGGTCCGT TTTCACCGAT TGGGTGCCCC ATGTCTCGCG ATTTTCGAGA
CGTGGGGATT TCCCCGACAT CTCCACGAAA GCAGCCACCA TCGTCCTCGT CCTCTCGGCG
TTTCTAGTCG CGGTCCGAAT CTCCGATCTC GTCACCAACC GCTTCTACCT GCGCACGCCC
GGCCAATACT CCGTCTTCGG CGCAGGCGCT CCCATACGCT TCCCCGCTGG CGCCGCGGAT
TTCATCGTTC GTAACCACCT CCCCGCGAAC GTCTTCAACG ACTACAACTC TGGCGGCTTC
CTCATGGGCA AGCTCGCTCC CGAATACCGT CTCTACCTCG ACGGCCGCGG CGAACTCGAA
CCCGGCCTCT ACGTCCACGC GCAGCAACTG CTGACACAGT CGCTCGACTC GCAAGACTGG
CAGCGCGAAG TCGCATCGCG CCACATCAAC ACGGTCGTCG TCTCCCTCGA CCGCGAATAC
GGCATGGGCC TCGCGAGCCT CAACAAGTTC TGCAACAGCC CCGGATGGAA GCCCGCTTAT
CTCGATCCAT TTGGCGCTGT CTTCGTAAAT ATGGGTGCCC CACCGTCGGG GTCCCCGGAG
AGCGCCGCCT TTGCGCTTTC TGGGGTGGCA GGCTTCCCGA AGGGTGAGTG GGAAGAGACG
CAACTCGACT GCTCCCAAGT TCGCTTCGAC GCTCCTCCAA CTGGCGACAG CTTCCGCGCT
CGCGCCGATC GCTTCAACTA CCTCCTCAAC AGCGCCGCAA TCCTCATCGT TCTCGATCGC
ACCGCCGAAG CGCTCTCTGC CTTGCAAAGC GCCGAGGCCG TCGAGTCGCA AAATGCATTC
CTCCACTACG CCAAAGGCGC CGCGCTTCTC CAATCCGGTC GCTGGAACGA GTCGGAGTCT
TCGCTCCACA CTGCCGTGAA TCTCGGTTCC GACGAAGCCG CTTCCGCGCT AGCCCGCGCC
TACGACCAGC AAGGCCGCTA CCCCGACGAA GTCGCAGTCC TGCATCTCGC CGCCTCCCGC
GCACCGCAAC CAAGTTGGTT CTACCTCAAG CTCGGCCTCG CCGAACTCGC GCAAAACCAC
GCCCGCGAAG CCCTCGACGG ATTCCACAAT GCCGAGCGCG AAGACCCGTT TAACGGCGGG
GACGACGCCG GCACCGGCTA CCATTCCCAA CTCGCCGAGG GCCATGCCCG CGCCGAAGCG
CTGCTACAAT CGCAGCGCTA A
 
Protein sequence
MGLRLDFVDD FLVSSMPSNV ASQSASPPRG ESVRYWIAVG LALSYAFFAG LKTAADPDLG 
WQLAAGRWML EHHQILRTDV FTYTGFGREW IYPALSQIFE YLLYRIGSYS LLSWTAAVGC
VATVALLLHG ARFATAIIAI LAIPLLAARC VMRAELFSVI LFAAFVSILW NFHRSRRGLL
WILPLLMALW VNLHLGFLAG FGMCAAYVLL EIGELFTLQK RSDALSRLRS AAPWLLATIP
ATLLNPWGWR VYAGMFRLMP TGTNPFILEL MRVRVNSTTA MQAFAWRDYE SAFFWFLAIA
AICTVAALMQ RRFAEAVILV GSVYAAVHAS RFVAMFAIIV VVIGGSVFTD WVPHVSRFSR
RGDFPDISTK AATIVLVLSA FLVAVRISDL VTNRFYLRTP GQYSVFGAGA PIRFPAGAAD
FIVRNHLPAN VFNDYNSGGF LMGKLAPEYR LYLDGRGELE PGLYVHAQQL LTQSLDSQDW
QREVASRHIN TVVVSLDREY GMGLASLNKF CNSPGWKPAY LDPFGAVFVN MGAPPSGSPE
SAAFALSGVA GFPKGEWEET QLDCSQVRFD APPTGDSFRA RADRFNYLLN SAAILIVLDR
TAEALSALQS AEAVESQNAF LHYAKGAALL QSGRWNESES SLHTAVNLGS DEAASALARA
YDQQGRYPDE VAVLHLAASR APQPSWFYLK LGLAELAQNH AREALDGFHN AEREDPFNGG
DDAGTGYHSQ LAEGHARAEA LLQSQR