Gene Acid345_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1645 
Symbol 
ID4072532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1992152 
End bp1993381 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content59% 
IMG OID637983654 
Productflagellar basal body FlaE 
Protein accessionYP_590721 
Protein GI94968673 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGT TTTCGATTCC GTTGTCGGGC CTCACCGCGA GTTCCACCGC ATTAGCCACG 
ATCGCCAACA ATCTTGCCAA TCAGAACACG ATTGGCTACA AGCAGACACG CGCACTGTTC
CGCGACCTCT TTTATCAGCA GATCGGACAG ACCGGCAGTG GCGATCCTAT TCAGGTCGGC
GCCGGAACCA TGATCGGCAC CATCGACACC AACTTCACCG ATGGCAGCGT GAGTCCGACC
GGCGTGCCAA CCGATGTGGC GATCATGGGT GACGGTTTCT TCGTGGCCCA GCAAAACGGG
AACGATATTT ACACTCGCGC CGGTAATTTC AAGGTTGGCG CCGATGGGAC CCTCAGGACG
CAGGATGGCG CGGTGGTGCT TGGCTACCAG GCCGTGGACG GCAAGGTCAC AACAGGATCC
GGGCTGGGTG CGCTGAATCT CGGCCAAGGC CAAGTTAGTT CGCCCTCAGC GACGACTTCC
CTCCAGTTGA CGACGAACCT CAATGCCAGC GCGAAAGTTG GCGACAGCTA CAACACCTCG
TTAAAGGTCT ATGACTCGCT GGGGGGCGTC CACGTAGTGA CTTTTACGTT CACGAAGACT
GGAACCAACA CGTGGGACTA CGACGCTTCT CTGCCCACCG GAGAAGGCAC GGTTTCGCTG
CCGTCCGGCA GCCACACGCT GACCTTCGAC AGCGACGGCA AACTTACGAC TCCGTCTTCG
AACATCAATT TCGACCTGAC GGGCCTGAGT GATGGCGCAA GCGACATGAA AGGGGTGACG
TGGAAGCTCT ATGACGCTAC CGGCGGTTCG TCGATGACCC AGATGGCAGC GGACAGCGCG
ACCCCGGCGA CGGCACAGGA CGGCTACGGC AGCGGCATGT TGCAGAACTT CAACATCGGC
GCGGATGGAA CCATCGAAGG AACCTTCAGC AACGGTAAGA CTTCGATCAT CGGGCAGATC
GCGATTGCGA GCTTTCCAAA TGTGCAGGGA CTCAGGAAGG TAGGCCAGAA CGCATACGTT
GGGACTCTCG CGTCGGGCCA GGCAGCGCTC GGCGCGCCGG GAAGCGGCGG ACGTGGCACC
CTCGGGGGGG GAGCGCTGGA GCTTTCGAAT GTTGATATGG CGACGGAGTT CTCCAATCTC
ATCGTGGCGC AGCGAGGTTA CCAAGCCAAT GCCAAGGTGA TCACAACCTT CGATGAGATC
ACCCAGGACA CCATTAACCT CAAGCGCTAA
 
Protein sequence
MPMFSIPLSG LTASSTALAT IANNLANQNT IGYKQTRALF RDLFYQQIGQ TGSGDPIQVG 
AGTMIGTIDT NFTDGSVSPT GVPTDVAIMG DGFFVAQQNG NDIYTRAGNF KVGADGTLRT
QDGAVVLGYQ AVDGKVTTGS GLGALNLGQG QVSSPSATTS LQLTTNLNAS AKVGDSYNTS
LKVYDSLGGV HVVTFTFTKT GTNTWDYDAS LPTGEGTVSL PSGSHTLTFD SDGKLTTPSS
NINFDLTGLS DGASDMKGVT WKLYDATGGS SMTQMAADSA TPATAQDGYG SGMLQNFNIG
ADGTIEGTFS NGKTSIIGQI AIASFPNVQG LRKVGQNAYV GTLASGQAAL GAPGSGGRGT
LGGGALELSN VDMATEFSNL IVAQRGYQAN AKVITTFDEI TQDTINLKR