Gene Acid345_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0100 
Symbol 
ID4069475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp104112 
End bp105413 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content60% 
IMG OID637982100 
Producthypothetical protein 
Protein accessionYP_589179 
Protein GI94967131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.807355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.588322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTATTC GCTCTCTCGC GCTACGCGCT CTTACACTCT GGTGCTTGAT CGTGCTGCTG 
TGCGCGCCCG GCGCCTACGC GTACTCGGTT CTTACTCACC AGGCAATCAT TGATCTGGCA
TGGGACGATT CGATTCGCCC ATTTCTCTTG AGCCGGTATC CGAACGCGAC AGCAGAACAA
CTTCAGGTTG CGCATGCCTA TGCCTACGGT GGGTGCGCGA TCCAGGACAT GGGGTACTAC
CCATTCGGGC ACACCTTCTT CAGTGATCTC ACTCACTATG TGCGTGCCGG AGATTTTGTC
GCCAGCCTGT TCCGGAACGC GCAGAACTTG AATGATTTGG CTTTCGCCGC GGGCGCGCTT
TCTCACTACC TCGGCGATTC TTTCGGCCAC TCCATCGCCA CGAACCAGGC CACACCCATC
GAGTTCCCAG ACCTGGGCGC GCGGTATGGG ACTGTGGTGA CGTACGAGCA GGACCCGCAT
GCGCACGTTC GCACTGAGTT CGGCTTCGAT ATCGAGCAAG TCTCGAAGCA GCGGTTCGCG
CCGCACTCGT ACCTCGTACA CATCGGGCTG CTGATACCAC GTCCTTTGCT GGAGAAGGCG
TTCTTCGAGA CTTACGGCAT GCCGCTCCAC ACCCTGCTCG GCGAAGAGGG GCCGTCGATG
CGGAGCTACC GGTCGGCGGT ACGCAGTTTC ATACCATTCT TCGCGCGCGG CGAGGTGGTG
CTGCATCGGC ACGAGTTTCT GCAGGAGCAG CCGAGTCCGG AGTTCTCCAC GTATTCCGAG
GAATCGGAGC ATGCCGACTT TCGAAACCAC TCGCCGCAGG GGTACCGGAA CCCTGGATTT
GTCGGGCATC TGTCGGCGGC GATCGTCTGG ATCGTGCCTA AGAGGGGTCC GGCTGCAATG
CTGGCAATCA AAATCCCCAG CCACGAGTCG CAGGAGCTTT ATGCGAAGAG CATGACTACG
ACACTGGAGC ATCTGCACAA GCACCTCGGG GACCTCGGGC ACGGAGAGGT CACGACCTTT
GCGCTGGCAG ACCGCGATCT TGACACCGGA GCGAGAACAA AGCCAGGTGG CTACGCGCGC
ACGGACGCGA CCTACGCCAA GCTCCTGCAC GACGTGGTGA CCCGGCCGCA AATGACGATC
CCACTGGGAT TGAAAGAAGA TGTTCTCGCA TACTATGCGG ACCTGAATGC GCCGATTACG
ACCAAACAGA ACCCGAAGCA ATGGGCGCAG GTGCAGCAGG AACTGGAGCA TTTCCGGACC
ATGAAAAGCA CCAGCCAGGT TCTGATTCCG AGCGAGCCGT AA
 
Protein sequence
MPIRSLALRA LTLWCLIVLL CAPGAYAYSV LTHQAIIDLA WDDSIRPFLL SRYPNATAEQ 
LQVAHAYAYG GCAIQDMGYY PFGHTFFSDL THYVRAGDFV ASLFRNAQNL NDLAFAAGAL
SHYLGDSFGH SIATNQATPI EFPDLGARYG TVVTYEQDPH AHVRTEFGFD IEQVSKQRFA
PHSYLVHIGL LIPRPLLEKA FFETYGMPLH TLLGEEGPSM RSYRSAVRSF IPFFARGEVV
LHRHEFLQEQ PSPEFSTYSE ESEHADFRNH SPQGYRNPGF VGHLSAAIVW IVPKRGPAAM
LAIKIPSHES QELYAKSMTT TLEHLHKHLG DLGHGEVTTF ALADRDLDTG ARTKPGGYAR
TDATYAKLLH DVVTRPQMTI PLGLKEDVLA YYADLNAPIT TKQNPKQWAQ VQQELEHFRT
MKSTSQVLIP SEP