Gene Acid345_0909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0909 
Symbol 
ID4069120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1139184 
End bp1140965 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content55% 
IMG OID637982916 
ProductASPIC/UnbV 
Protein accessionYP_589986 
Protein GI94967938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.442824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0377132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAT TCCGGTACCC AATGAGACTT TTTCTATCGT TTTTGGCGGT TCTTCTGACC 
TCTGGTTTTG GCCAGGTTCC CGGTCCACCC GCTCCCGCAA AACCGTCGAT TCCGAAGTTT
GAGGATATCG CTAAAAAGGC GGGTGTCACG GGGTCGCACA TTTCGTCCCC GGAGCAGCAC
TACATCATCG AATCGATGAG CGGCGGCTCT GGACTCTTCG ATTGCGATGA CGACGGAAAG
CTCGATCTGC TGATCGCTGG CGGATCGACT GTAGACCGAT TTAAGCAAGG GGGGGATCCG
CTCGTCCGGC TCTATCACCA GGATGCCGAC CTAAAATTCA CCGACATCAC CGAAGCTGCA
GGTCTGACGG TCAAAGGCTG GGGAATGGGC GTCGCGGTTG CCGACTTCGA CAACGACGGG
ATCTTGGACA TCTACGTTAC CGGCTATGGG AGAAATGCGC TCTACAAGGG TCTCGGAAAC
TGCAAGTTTA AAGACGTGAC TGAAAAGGCC GGCGTCGCGA TGGATGGGTT AAACGCTGGC
GCCGCGTGGG GAGATTACGA CAAGGACGGC AATGTAGATC TGTTTGTGTC GAGGTACGTC
CACCTCGACA TAAATAACTT GCCAAGTTTC GGGAGCGACG AACGATTCTG CCGATTCAAA
GGTGTCCTGG TGCAATGCGG GCCGTGGGGC ATGCAAGGCG AGAGCGACAA GTTGTTTCAC
AACCGAGGTG ATGGAACTTT CGAAGAAGTC TCAAAAAAGG CAGGCGTGGA CGATCCGAAG
CATCGCTACG GACTCGGTGC AATTTGGACG GATTACGACA ACGATGGATG GCCGGATTTG
TATCTGGCGA ACGATGCGGG GCCGAATTTT CTTTATCACA ACAACCACAA TGGCACCTTC
GAAGAAGTCG GTTTGGTCTC GGGCGTCGCA CTCAGTGACG ACGGACAAGA GCTTGGGTCG
ATGGGAGTGG ACTCAGGCGA CTACGACCAC GATGGCAACT TCGATATCTT CGTTACGAAT
TTTACGGATC AACCCGACAA CCTCTACCAC AATCTCGGAA ACAAGAGCTT TACGGACATG
GCATGGGCGT CTGGGTTGGG GCAAGGAAGT TTCTCGTACG TGAAGTGGGG CACGGGATTC
GTGGACTTCG ACAATGATGG CTGGCCGGAT GTTTTTGTTG CGAACGGACA CGTCTACCCG
CAGGTGGACG CGATCCCCGG GAGTCCTCGC TACCGCGAAC GGATGCAGTT ATTCCAAAAC
CTGCAAAACG GAACGTTCAA AGATATATCT GAAGGTGCCG GTTTGAATGC GATTCCCGAA
CAGTCTCGGC GAGGGGCCGC ATTCGGCGAT ATCAACAACG ACGGAAATGT CGACATTCTC
CTGCAGAACA TAGGAGAACC GCCGAATCTG CTAATCAACC AGACGCAGAA TTCGAACCAT
CGCGTCATCT TCAAGCTCGT GGGAACAAAA AGTAACCGCG CCGCGATTGG TGCGCGGATC
ACGGTGATCT CTCCCACGCT CAAGCAAATG AATGAAGTTC GGAGTGGGGC GAGTTACCTC
TCGATGAACG ACCTACGTGT GCACTTCGGA CTTGGCGCTG ACGACAAGAT GACCACCGTG
GAGATCTGGT GGCCGACCGG AAAGAAAGAG GTCCTGCGGG ATGTGCCAGG GGACTTCATT
TACGAGATCG TAGAGGACCA AGGCATCCGC AAGCGCACCC AACTGCCTCC GGTCGCGGGA
GCAACCAGTA CGACGTCTGC CGCGGAGTCA CCGAAACAAT GA
 
Protein sequence
MTRFRYPMRL FLSFLAVLLT SGFGQVPGPP APAKPSIPKF EDIAKKAGVT GSHISSPEQH 
YIIESMSGGS GLFDCDDDGK LDLLIAGGST VDRFKQGGDP LVRLYHQDAD LKFTDITEAA
GLTVKGWGMG VAVADFDNDG ILDIYVTGYG RNALYKGLGN CKFKDVTEKA GVAMDGLNAG
AAWGDYDKDG NVDLFVSRYV HLDINNLPSF GSDERFCRFK GVLVQCGPWG MQGESDKLFH
NRGDGTFEEV SKKAGVDDPK HRYGLGAIWT DYDNDGWPDL YLANDAGPNF LYHNNHNGTF
EEVGLVSGVA LSDDGQELGS MGVDSGDYDH DGNFDIFVTN FTDQPDNLYH NLGNKSFTDM
AWASGLGQGS FSYVKWGTGF VDFDNDGWPD VFVANGHVYP QVDAIPGSPR YRERMQLFQN
LQNGTFKDIS EGAGLNAIPE QSRRGAAFGD INNDGNVDIL LQNIGEPPNL LINQTQNSNH
RVIFKLVGTK SNRAAIGARI TVISPTLKQM NEVRSGASYL SMNDLRVHFG LGADDKMTTV
EIWWPTGKKE VLRDVPGDFI YEIVEDQGIR KRTQLPPVAG ATSTTSAAES PKQ