Gene Acid345_1255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1255 
Symbol 
ID4069830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1527935 
End bp1529041 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content59% 
IMG OID637983264 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_590331 
Protein GI94968283 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.180206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.312586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTGGA AGGGATTTCA AAAACCGAAG CGCCTCGCGT TTGACTCGGA GTCGTTGACC 
GACAAGTATG GTCACTTCTG GGCCCAGCCG TTTGAGCGCG GCTTCGGAAC CACGATTGGC
AACGCGCTGC GCCGCGTGCT GCTTTCCTCG ATTGAGGGCG CTGCGATTAC CGCAGTGAAG
ATTGAAGGCG TATTACACGA ATTCCAGTCA ATCCCTGGCG TCGTAGAGGA TGCGACGGAC
ATCATCCTCA ACCTGAAGCA GATTCCGTTC CGCCTCAACG GAGACGCTCC CAAGGCGATC
TACCTGCGCG CGGAACAGCC TGGCATTGTG ACCTCGGGCA TGATCGAGAC CGATGCCGAT
GTCGAGATCC TCGACAAGGA CGTGTATATC GCCACCATCA GCGAAGGTGG CAAGCTCGAC
ATGGAAATGC GGTTGAAGAA GGGCCGCGGC TACGTGTCAG CCGATAAGAA CTTCGACGAA
GACCTTGGCC TCGGGTTCAT TCCGATCGAC TCGGTCCACT CGCCCGTCCG CAAGTGCAAC
TACTCGGTGG AAGCAGCCCG TTTGGGTCAG ATCACCGACT ACGACAAGCT CTCGATTGAA
TTGTGGACCA ATGGCTCCGT GAACCCGGCC GACGCGCTCG GCCTGGCCGC GAAGCTGCTC
AAGGACCACA TGAACATCTT CATCAATTTC GAAGAAGAAA TCGAAGCTTC GCACGCGGAA
GACCGCAAGC CGGAAATCCG CAACGAGAAC CTGAACCGCT CGGTGGAAGA GCTCGAGCTT
TCGGTCCGCA GCTACAACTG CCTGAAGAAT GCCAATATCC AGACCATCGG AGAACTGGTG
CAGAAGACCG AAGCAGAAAT GCTCAAGACC AAGAACTTCG GCCGCAAGTC GCTCAACGAG
ATCAAGGAAA TTCTGGCCTC GATGGGACTG AGCCTGGGCA TGAAGATCGA CGAGCATGGC
AACGCGGTGG CTCCGCCTCC GGGTTCGCAA CCTGCTCCGA GCTACGGCGG CTATCCGGGA
AGCTACGGCA CCGGCGGAAC GTTCGGTGGC GGCGGCAACT ACGGTGGTGG CGGCGGCTTC
GGCGGCGACA ACAACCCGGG CTTCTAG
 
Protein sequence
MLWKGFQKPK RLAFDSESLT DKYGHFWAQP FERGFGTTIG NALRRVLLSS IEGAAITAVK 
IEGVLHEFQS IPGVVEDATD IILNLKQIPF RLNGDAPKAI YLRAEQPGIV TSGMIETDAD
VEILDKDVYI ATISEGGKLD MEMRLKKGRG YVSADKNFDE DLGLGFIPID SVHSPVRKCN
YSVEAARLGQ ITDYDKLSIE LWTNGSVNPA DALGLAAKLL KDHMNIFINF EEEIEASHAE
DRKPEIRNEN LNRSVEELEL SVRSYNCLKN ANIQTIGELV QKTEAEMLKT KNFGRKSLNE
IKEILASMGL SLGMKIDEHG NAVAPPPGSQ PAPSYGGYPG SYGTGGTFGG GGNYGGGGGF
GGDNNPGF