Gene Acid345_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1897 
Symbol 
ID4073358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2274454 
End bp2275539 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content61% 
IMG OID637983906 
Producthypothetical protein 
Protein accessionYP_590972 
Protein GI94968924 
COG category 
COG ID 
TIGRFAM ID[TIGR03118] conserved hypothetical protein TIGR03118 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000359391 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000915589 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAATC CGCGCAGGCA ACTCCTCTGG GCTGCAGCAG CCCTTGCCGT GCTCACCCTC 
CCCGCCAATG CGCAGCACTA CACGCGCACC GATCTCACCA CCGACGCCGC CAGCGTCACC
ACCGCACCAA ACATTGACGC GAACCTTGTC AACGCATGGG GCCTCTCGCG CTCGTCCGGA
AGCCCCTGGT GGGTCTCCGA TAACGGCACT GGCCTTTCCA CCCTGTATGA CGGCGCCGGC
GTTCCACAAT CGCTGGTCGT CAAGATTCCC CCTCCTGGAG GCTCCACAAG TCCCGCTACA
CCCACCGGCA CCGTATACAA CTACACCACC TCGTTCGCCG TGGGTGGTAA GCCGGCGGTC
TTTCTCTTCG TTACCGAAGA CGGCACCATC TCGGGATGGA ACCCCACGGT GAACTTGACC
AACGCGATCA TCGCAGTAGA TCGCTCCAAG AGCGCCATCT ACAAAGGCTG CGCGATTGCC
CAGACCGCAT GGGGCCCACG TTTTTACGCG ACGAATTTCA AGAGCGGTCG CATCGAAATC
TTCGACGGCA GCTTCCATCG CCTTTCCACC GATCATCATG CCTTCCGCGA TGAACGCCTT
CGGGACGATT TCGTTCCCTT CAATGTCCAG AACGTCGGCG GCAATCTGGT TGTCACGTTC
GCGCACCGCG AAGAGGGAAG CCACGATGAA GATCACGGCC CCGGAGTGGG ATACGTGGAC
ATCTTCGACG TCTACGGCAA TCTCATCCAG CGCTTGCAGC ACGGCAAATT CTTGAACGCT
CCCTGGGGCA TCGCTGCGAC GCCAGCCGAT TTCGGCGCCT TCAGCCATCG CCTCCTCATC
GGCAACTTCG GCGACGGCAA GATCAATGTC TTTGATCCCA TCACTGGCAA GTTCCAGGGC
CAATTGCTCG ATGCCTCCGG TGCTCCGATC GCCATTGACG GACTCTGGGC ACTGAGCTTC
GGCAACGGCT CCAAAGCCGG CAACGCCAAC GACCTCTACT TCACCGCGGG ACCGAACGAC
GAGGGCGACG GCATCCTAGG CAAACTAAGC GCCGTAGGCA CCGAACAGCG CGGCAATACC
GAATAG
 
Protein sequence
MSNPRRQLLW AAAALAVLTL PANAQHYTRT DLTTDAASVT TAPNIDANLV NAWGLSRSSG 
SPWWVSDNGT GLSTLYDGAG VPQSLVVKIP PPGGSTSPAT PTGTVYNYTT SFAVGGKPAV
FLFVTEDGTI SGWNPTVNLT NAIIAVDRSK SAIYKGCAIA QTAWGPRFYA TNFKSGRIEI
FDGSFHRLST DHHAFRDERL RDDFVPFNVQ NVGGNLVVTF AHREEGSHDE DHGPGVGYVD
IFDVYGNLIQ RLQHGKFLNA PWGIAATPAD FGAFSHRLLI GNFGDGKINV FDPITGKFQG
QLLDASGAPI AIDGLWALSF GNGSKAGNAN DLYFTAGPND EGDGILGKLS AVGTEQRGNT
E