Gene Acid345_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0200 
Symbol 
ID4069669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp213300 
End bp214415 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content61% 
IMG OID637982200 
Producthypothetical protein 
Protein accessionYP_589279 
Protein GI94967231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.337224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTG TTCGCCTCCT CGTTGGCACC AAGAAAGGTG CCTTCGTCCT TACCTCTGAC 
GGCAAGCGCG AAAAGTGGGA CGTCGCCGGC CCCCATTTCT CCGGGTGGGA GATGTACCAC
CTCAAAGGAT CTCCTGCCGA CCCGAATCGC ATCTACGCAT CGCAGACCAG CGGCTGGTTC
GGCCAGCAGG TCCAGCGCTC CGACGATGGC GGCAAAACCT GGAACCCCGT CGGCAACAAG
TTCATTTATG ACGGCGTCCC CGGCACCCAC CAGTGGTACG ACGGCACGCC CCACCCGTGG
GATTTCAAGC GCGTGTGGCA CTTCGAACCG TCCCTCACCG ATCCCGACAC CGTCTATGCG
GGCGTCGAAG ATGCCGCCAT CTTCAAGACC AACGATGCCG GCCAGACGTG GACCGAACTT
CCCGGCCTTC GCAACCACTC CACCGGCAAC AAGTGGACGC CCGGCGCCGG CGGCATGGGC
CTGCACACCA TCATCCTCGA TCCCACCAAC GACAAGCGCA TCTGGATCGC CATCTCCGCT
GCCGGAGCTT TTCGCACCGA TGACGGCGGT GCCACCTGGA AGCCGATCAA CAAGGGCCTC
ACTTCGAAAT ACATCCCCGA CCCGGACGCC GAGATCGGGC ACTGCATCCA CCACATGGCC
ATGCATCCGA AGAAGCCCGG CACGCTCTTC ATGCAGAAGC ACTGGGACGT CCTCCGCAGC
GACAATTCCG GCGACCTCTG GACCCGCGTC AGCGGCAACC TGCCCACCGA TTTCGGCTTC
GTCATCGACG TCAACGCGAA CGAACCTGAG ACCATCTATG TCGTGCCCAT CAAGAGCGAC
GAACACCATT ACCCCATGGA TGGCGCCCTC AAGGTCTATC GCAGCAAGTC TGGCGGCAAC
GAATGGGAGC CGCTCACTAA GGGTCTCCCG CAGCAGAACT GCTACGTAAA CGTGCTGCGC
GACGCCATGT CCGTAGACCA ACTCGATAAG TGCGGCATTT ATTTCGGCAC CACCGGCGGC
CAGGTCTACG CGTCCAATAA TGCAGGAGAC AGTTGGCAGG CGATCGTTCA CGATCTCCCG
GCCGTCTATT CCGTAGAGGC GCAAACGCTA CCATGA
 
Protein sequence
MSSVRLLVGT KKGAFVLTSD GKREKWDVAG PHFSGWEMYH LKGSPADPNR IYASQTSGWF 
GQQVQRSDDG GKTWNPVGNK FIYDGVPGTH QWYDGTPHPW DFKRVWHFEP SLTDPDTVYA
GVEDAAIFKT NDAGQTWTEL PGLRNHSTGN KWTPGAGGMG LHTIILDPTN DKRIWIAISA
AGAFRTDDGG ATWKPINKGL TSKYIPDPDA EIGHCIHHMA MHPKKPGTLF MQKHWDVLRS
DNSGDLWTRV SGNLPTDFGF VIDVNANEPE TIYVVPIKSD EHHYPMDGAL KVYRSKSGGN
EWEPLTKGLP QQNCYVNVLR DAMSVDQLDK CGIYFGTTGG QVYASNNAGD SWQAIVHDLP
AVYSVEAQTL P