Gene Acid345_4123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4123 
Symbol 
ID4072314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4885042 
End bp4886097 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content58% 
IMG OID637986154 
Producthypothetical protein 
Protein accessionYP_593197 
Protein GI94971149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.962461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGC TCCGCCATGT CGCGATGGTG ATGCTGCTCG CTGGCGTAAC TTTTGCCGGG 
GTGTGGCTGG CGCAGATCGA AGGCACTTGT CTGGACGAAG CCGGCAACCC CCTCGCCAAC
GCCGAACTGA AATTTCTCGA CAAGCACAAC GGCCATCGCT TTTCCGTGAA GACCGACGCG
AAGGGGAAGT TCTTCTTCGG CGGCGTGGAT CCGGGTGCGT ACTCGGTGAC AGTATTGCGC
GGCAACCAGG TGGCGATGGA ATTTCCGGCG ATCGCGATTA GCTGGAGTTC GCGGCCGCAG
CAGTTGGCGC TGGACCTGGC AAAACATTCC ATCGAGGTGA AGCGCGAAAC ACGCCAGGCG
GAGACGCTCG GTGGAGACAC TTCTCCAGAC GACTTTACGC CGGTGGTAGT GGGAGATGAC
GCGCAGACCG TAGCGGTACG AACGGCAATC GAGCAGGCGC AAAAGCAAGG ACAGAATGGA
GACTGGGCCG GAGCGATTGC GACGCTGAAG GCGAACGCTG AGTCATCGGG CGCGAAGTAC
GACATGGTGT GGGCGCAGTT GGCGAGTGCG TATTGCCACG CTAGCAAGTT TGAAGATTGC
GCTGCGGCGT ACGGGAAGGC GCTTGCGCTC AAAGAAGTGG GTGCGTATTA CAACAATCGC
GCGCAGGCGC TGGTCGTACT GAAACGATGG AATGAAGTTG ATCACGACAT GATGCTGGCG
GAGAAGATGA ACCCGGAGCA TCGCGTGCTC TATGAGCGGA ACCACGGCAT GATGCTGGTG
CAGAAAATCC AGAACGGCGA GAGCGACAAT ACGGCTACGG ATTTCGAGGG CGCAGTTCGT
GCCTTAAGCT CTGTGCTGCA AGAAGAGCCG GCGAATGCTG AGCTTTATTA CTTACGTGCA
TATTGCCAGA TCCGATTGCT CGGTGTGGCG AAGGAACCGC CTGCGTTTTC GGCAATTGAG
AGTGGACTGC GCAAGTATCT TGAGTTGGAG CCGCACGGGA AGCATGCCGA AGAAGTGAAT
GCGATGCTGA AGAGTGTGGA AGAAGAGAAG CGGTGA
 
Protein sequence
MKMLRHVAMV MLLAGVTFAG VWLAQIEGTC LDEAGNPLAN AELKFLDKHN GHRFSVKTDA 
KGKFFFGGVD PGAYSVTVLR GNQVAMEFPA IAISWSSRPQ QLALDLAKHS IEVKRETRQA
ETLGGDTSPD DFTPVVVGDD AQTVAVRTAI EQAQKQGQNG DWAGAIATLK ANAESSGAKY
DMVWAQLASA YCHASKFEDC AAAYGKALAL KEVGAYYNNR AQALVVLKRW NEVDHDMMLA
EKMNPEHRVL YERNHGMMLV QKIQNGESDN TATDFEGAVR ALSSVLQEEP ANAELYYLRA
YCQIRLLGVA KEPPAFSAIE SGLRKYLELE PHGKHAEEVN AMLKSVEEEK R