Gene Acid345_4366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4366 
Symbol 
ID4071784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5174062 
End bp5175552 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content60% 
IMG OID637986399 
Producthypothetical protein 
Protein accessionYP_593440 
Protein GI94971392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.9507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.531855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG ATGTGGTGGA GACTCCGACG AAGGCGGAGG CGGTGGATGA TCCGGATCTT 
CTCGATCTAG TGGAGGAGGG CCGGAAACTG GACCAGAGGT ATTCGAAGCG GTGGCTGAAT
CGTGACGTGT GGACGACGAT GCTGTTGCAG ATCCGCGAGA AGCATGGGCA CCTGATTCCA
CTTCGGTTGA ACCGGGCACA GCAACACTAT GCGAAGACGT GCTCGCGGCG AAACATTGTT
TTGAAGGCGC GACAGCTGGG AATTACGACG TATGTCGCGT CGCGATTTTT CTTGAGTACG
ATTATGCGGC CGGGGACGGT TACGGTCCAG GTGGCGCACG ACCAGACGGC GGCCGAGGAG
ATCTTCCGCA TCGTGCATCG CTTCGTGGAG AACCTGCCCG AAGAGATGCG GAAGGGCGCT
TTAACGACGT CGCGGCTGAA CACGCGGCAG ATTGTGTTTC CGAAACTGGA TAGCGCGTAC
CTGGTGGAGA GCGCGGCGGA CGTGAATGCG GGGCGTGGGC TGACGATCCA TAACCTGCAT
TGTTCGGAGG TGGCGCGCTG GCCGGGAGAT GCGGCGGAGG TGCTGGCGTC ACTGCGGGCG
GCGGTGCCGA AGCATGGCGA GATTGTGCTG GAGAGCACGC CGAATGGCGC GGGTGGATGT
TTTTACGATG AGTGGCAACA TGCGGAAGAG AAGGGATACA CGCAGCACTT CTTTCCGTGG
TGGTGGGAGA AGAGCTACAC GATTGGGCAT CGCGCGGAGG AGCTGAGTCC GGAGGAGGAG
TCGCTCGTGG GGCGATATGG ATTATCGCGA GAGCAGATTG CGTTTCGTCG CGAGCTGCAA
TTTAACTTCG GCAAATTGGC GCGGCAGGAG TATGCGGAGA CGCCGGAAGA GTGTTTCCTG
GCGAGCGGCG AATGCGTGTT CGAAGTGGAC GTCATCGAGA AACGTTTGGC CGAATTGCGC
GGGCCGGTGG AGACGCGCGA GAACGGGCGG ATCGAGACTT ACTATCCGCC GGTGCGTGGA
CGAGAGTATG TGATTGGCGT GGATCCGGCG GGCGGTGGAT CGGAAGGCGA CTATGCCGCA
GCGCAAGTGA TTGAACGCTC GACGGGGTTG CAGTGCGCGG AATTGCGCGG GCATTACACG
CCGGTGGAAC TGGCTTCGCG AGTTTCACAG TTGGGTCGCG AGTATAACGA CGCGTTGGTG
GCGGTGGAGC GGAACAATCA TGGTTGCGCC GTGCTGGTGT GTTTGGAACA GAGTTATCGT
CATCTTTATG AAGAGCGCGG GCAGACGGGG TGGTTGACTA CTTCGGCTTC GCGGCCTCGG
ATGATTGAGC AGTTGGCTAG CGTTCTACGG CAGGAACCGG AGAAATTCGA ATCGCGGCGG
TTGCTGGAGG AGTGTAAGGC GTTTGTGCGG AAGAGCGACG GAGCGTGCGC GGCTAGTAGC
GGAGCGCATG ATGATTTGGT TTTGGCCATG AGCATTGCGG TGAGTGTTTA G
 
Protein sequence
MSSDVVETPT KAEAVDDPDL LDLVEEGRKL DQRYSKRWLN RDVWTTMLLQ IREKHGHLIP 
LRLNRAQQHY AKTCSRRNIV LKARQLGITT YVASRFFLST IMRPGTVTVQ VAHDQTAAEE
IFRIVHRFVE NLPEEMRKGA LTTSRLNTRQ IVFPKLDSAY LVESAADVNA GRGLTIHNLH
CSEVARWPGD AAEVLASLRA AVPKHGEIVL ESTPNGAGGC FYDEWQHAEE KGYTQHFFPW
WWEKSYTIGH RAEELSPEEE SLVGRYGLSR EQIAFRRELQ FNFGKLARQE YAETPEECFL
ASGECVFEVD VIEKRLAELR GPVETRENGR IETYYPPVRG REYVIGVDPA GGGSEGDYAA
AQVIERSTGL QCAELRGHYT PVELASRVSQ LGREYNDALV AVERNNHGCA VLVCLEQSYR
HLYEERGQTG WLTTSASRPR MIEQLASVLR QEPEKFESRR LLEECKAFVR KSDGACAASS
GAHDDLVLAM SIAVSV