Gene Acid345_3907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3907 
Symbol 
ID4072244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4621969 
End bp4623192 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content59% 
IMG OID637985933 
Producthypothetical protein 
Protein accessionYP_592981 
Protein GI94970933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0452365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00418649 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCCGGCA TGCCGGACAT GAATATGCAT CCGCATGGCG GCATGTCCGC AAATACATTC 
GTCGAAGCAA TCGAGAACCA CAGCAGTTCT GGAAGCAGCG TGGAGCCGAT CTCCACTCCA
GTGGACATGA TGATGTCCCT GCACAAAGGC TGGATGCTTA TGCTGCACGG CGAACTCTTC
CTGAATGCAA TTCAGCAGAG CGGACCACGT GGTGGTGACA AAGTCTTCTC TACCAACTGG
ATCATGCCGA TGGCGCAGCG AAGTCTCGGG CCCGGCACCT TGACGGTCCG TACGATGCTT
AGCCTGGAGC CGGCGACGGT CACCCAGCGC CGCTATCCTG AGCTTTTCCA AGTGGGCGAA
ACGGCGTTCG GCAATTCCAT TGTGGACGGC CAGCATCCGC ACGACTTGTT CATGGAGATC
GCCGCACTTT ACGACATCAA ACTTTCGAAA GATGCACTAC TCTCGTTCTA TGCCGCGCCG
GTCGGCGATC CAGCGATCGG GCCCATCGCC TATCCCCACC GTATTTCAGC GTCGGAGGAC
CCTCTCGCGA CGCTTGGCCA TCACCTCCAG GACTCGACCC ACGTTGCCGC CGACGTATTC
ACCGGAGGGT TCACCTACAA GATGGTGCGG CTTGAAGCTT CGGGCTTTCA CGGGCGCGAA
CCAGATGAGA ATCGCTGGAA CATCGACCAG GGAACACTCG ATTCATACTC CGCGCGCATC
ACCATCGTAC CTGCGAAGAA CTGGTCCGGC CAGTTTTCGG CGGCACATAT CGTGAGCCCA
GAAGTTATTG CGCCCAACGA AGACCAGCTC CGCATGACCG CCTCCGTGAG CTACAACCGT
CCGCTCCCGC GAGGCAATTG GGCCAGCACT GTTTTATGGG GACGCACGCG CACTCTCGGC
CAATCGCAGC CCTTCAATGG ATATCTCGCG GAGAGCACCG TGAAATTTGC GGAGAAGAAC
TACGTCTGGG GGCGCATTGA GAACGTAGAC CGCAGCACGG AATTGCTCGA ACTCCCGTCA
GCGGGAGAGG GGTTCCTCGC ACGCGTACAG GCCTACACCA CGGGCTACGA GCGGACGTTC
CACGTAGTCG ACCGTGCGGA AACAGGACTT GGCGCGCAGG TCACCTTCTA CGCGAAGCCG
GATTTTCTCA CGCCGAGCTA CGGCGATCAT CCCACGGGCG TGGTGGCTTT CTTGAAGATC
CGTTTGCGCG GCAAAGGGCA GTAG
 
Protein sequence
MPGMPDMNMH PHGGMSANTF VEAIENHSSS GSSVEPISTP VDMMMSLHKG WMLMLHGELF 
LNAIQQSGPR GGDKVFSTNW IMPMAQRSLG PGTLTVRTML SLEPATVTQR RYPELFQVGE
TAFGNSIVDG QHPHDLFMEI AALYDIKLSK DALLSFYAAP VGDPAIGPIA YPHRISASED
PLATLGHHLQ DSTHVAADVF TGGFTYKMVR LEASGFHGRE PDENRWNIDQ GTLDSYSARI
TIVPAKNWSG QFSAAHIVSP EVIAPNEDQL RMTASVSYNR PLPRGNWAST VLWGRTRTLG
QSQPFNGYLA ESTVKFAEKN YVWGRIENVD RSTELLELPS AGEGFLARVQ AYTTGYERTF
HVVDRAETGL GAQVTFYAKP DFLTPSYGDH PTGVVAFLKI RLRGKGQ