Gene Acid345_0317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0317 
Symbol 
ID4068594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp342299 
End bp343450 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content58% 
IMG OID637982320 
Producthypothetical protein 
Protein accessionYP_589396 
Protein GI94967348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.102396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.535957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTCAC ATCGAAAGCA GCAACTCATC GGTCTTCTCC TCCTCCTCGT CTCTTTCACT 
GCGGTCGCAC AAACCCCGCG CTGGACGGAA GAAAAAGCCG CGCAGTGGTA CAAGCAGCAG
CCCTGGCTCG TCGGCAGCAA CTTCATTCCC ACCGACGCGA TCAACGAACT CGAGATGTGG
CAGGCCGACA CTTTCAACCC GCAGGAGATC GACCGCGAGC TCGGATGGGC CGAAGGCCTC
GGCATGAACA CCATGCGGGT CTTCCTGCAC GATCTTCTCT GGCAGCAGGA TGCCGCCGGC
TTCACCAAGC GCCTCGATCA ATTTCTCGGC ATCTGCGCCA AACACCACAT CCGCCCCATG
CTTGTGATCT TCGACTCCGT ATGGGACCCA AATCCCAAGC TCGGTCCGCA GCATCCGCCC
GTTCCCGGCG TGCACAACTC TGGATGGATG CAGAGCCCCG GCCGCAAAGG ACTCGAAGAT
CCCGCGGAGT ATCCGCGGCT GAAGGCATAC GTGCAGGGCG TGGTCGGCAA ATTCGCCAAC
GATCAACGCA TCCTCGCCTG GGATGTTTGG AATGAGCCCG ACAACGACAA CAAGCCTGCC
TACGAGCGCG TCGAGCTGCC GTACAAAGCC GACTACGTCA ATAAACTCCT GCCTCAAGTT
TTCGAATGGG CGCGTGAGAT GCATCCCATC CAACCACTCA CCAGTGGCGT GTGGCGCGGC
GACTATTCAT CTCTGGATAA GGCCATTCCA ACCGCGAAAA TCCAGCTGGA ACAGTCCGAC
ATCATCACCT TCCACAGCTA CGACTGGCCG GAAACTTTCG AAGAGCGTAT CAACTGGCTC
CGCGCCTACA ACCGGCCCAT CATCTGCACC GAATATATGG CTCGTCCCGC AGGCAGCACC
TTCGACACAG TCTTACCCGT CGCACTCAAA GAGCACGTCG GCGCCATCAA CTGGGGATTG
GTTGTCGGAA AAACGCAGAC CAATTTGCCG TGGGATTCTT GGAAGCGTCC TTACGTTCTG
GAGCCGCCGG TCGCGTGGTT CCACGAAGTC TTCTACGCCG ATGGTCGTCC CTACCGCGCT
CGCGAAGCCG AAATCATCCG CAATTTAACA TCGCAGGCCA ACGGGAACTC TGGAGATTCA
GGAACAAACT GA
 
Protein sequence
MRSHRKQQLI GLLLLLVSFT AVAQTPRWTE EKAAQWYKQQ PWLVGSNFIP TDAINELEMW 
QADTFNPQEI DRELGWAEGL GMNTMRVFLH DLLWQQDAAG FTKRLDQFLG ICAKHHIRPM
LVIFDSVWDP NPKLGPQHPP VPGVHNSGWM QSPGRKGLED PAEYPRLKAY VQGVVGKFAN
DQRILAWDVW NEPDNDNKPA YERVELPYKA DYVNKLLPQV FEWAREMHPI QPLTSGVWRG
DYSSLDKAIP TAKIQLEQSD IITFHSYDWP ETFEERINWL RAYNRPIICT EYMARPAGST
FDTVLPVALK EHVGAINWGL VVGKTQTNLP WDSWKRPYVL EPPVAWFHEV FYADGRPYRA
REAEIIRNLT SQANGNSGDS GTN