Gene Acid345_2930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2930 
Symbol 
ID4070854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3474814 
End bp3476190 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content57% 
IMG OID637984949 
Productflagellar hook-associated protein 
Protein accessionYP_592005 
Protein GI94969957 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.685458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAC TCTCAGGAAC ACTTTCGATT GCAACCCGGG CGATGCTGGC GGAACAAGGA 
GCCATGCAGG CGACGACGAA CAACATCGCC AATGTCAACA CTCCGGGGTA CTCGCGCCAG
ATTCCGATTT TTTCGGCCGT CGATCCAGTT GTGTCCGGCT CAACGACCTA CGGCAATGGA
GTCGAACTCA GGGGGTACCA AAGCTTACGC AACCGGGTGC TAAATCTTAG GATCGCGGAA
GAGCAGCAGA ACCAAAGCGC GCTGCAGAGT TACGTCTCCT CGATGGACCA AGTGCAGGTG
GCGTTCAGTG ACGCGAGCGG AGGGATTGGC GGTGCTCTCA GCGCATTCTT TAACAGCATC
TCGCAATTGT CGACGCAGCC GGATAGCACT ACCCTGCGAC AGAGTGTCCT GTCCTCTGCA
AATACTCTAG TCAACGCGTT TCATACTGAC GCAAGTTCGT TGAGTTCCAT GCAACAAGGG
CTCGACCTGG AGGTCAAGGA CCAGGTAAAC GAGGTCAATC GGCTCACCGT CCAGATCGCA
TCGCTTAACG GTCGCATTTC CGCGATGCAG AAGCTGCATC AGGACCCGGG AACACTGAGC
GATCAGCTCG ACCAATCGGT TTCCGAACTC TCCAATCTGC TGGATGTCTC CGTAACCCAA
ACCGAAGACG GTATCAGCCT GACGACCGCG AACGGCGTAG CGCTGGTGGT GGGCGACAAG
AGCTTCGGCC TTACGACGGC GGCAGATCCG GACACTGGCC TTTCGCGAGT GCAGGCCGGC
GAGATCGATC TTACGGCGCT GTTGCAGAAA GGCAGCCTCG GAGGTGTCCT TCGAGTTCGC
GACGAAGAAG TTCCTAGCAT CCAATCGCAG CTGGATAAAC TTGCTGCCGG GCTCTCGACG
GCAATGAACG CGGTTCACCA AACCGGCTTC GACCTCAATG GGAATCCGGG AGGCCTGCTT
TTCTCTGCGC CGCCCAGTGA TGGTAAGAAT GCAGCTGCTC TGATCAGCCT TGCCATCAGT
GACCCCGCAC AACTCGCGAT GAGCGGCAAT GGGGCTGCGG GAGACAACAC CGTCGCCAAT
GAGCTATTGC AGATCAAAAA CCAAGCTGTT GTAGGCAATG CGACGCCGAT CGATGCCTAT
TCGCAGATCG TGTTCAATGT CGGCAGCGAC ATTTCCGATG CCCAAGCAGG GCTCGATACC
AGCACTTCCC TGCTCCAGCA ACTGCAGGAC CAGCGCGGGG CTTTGAGCGG TGTTTCGCTT
GATGAAGAGA CTGCCAATCT GCTCCAGTTC CAGCGTGGAT TTGAAGCCGC GGCCCACCTT
GTGAGCGTAG TCGACGAGAT GATGCAGACA GTGATCGCGA TGGGAGTTAC ACAATGA
 
Protein sequence
MASLSGTLSI ATRAMLAEQG AMQATTNNIA NVNTPGYSRQ IPIFSAVDPV VSGSTTYGNG 
VELRGYQSLR NRVLNLRIAE EQQNQSALQS YVSSMDQVQV AFSDASGGIG GALSAFFNSI
SQLSTQPDST TLRQSVLSSA NTLVNAFHTD ASSLSSMQQG LDLEVKDQVN EVNRLTVQIA
SLNGRISAMQ KLHQDPGTLS DQLDQSVSEL SNLLDVSVTQ TEDGISLTTA NGVALVVGDK
SFGLTTAADP DTGLSRVQAG EIDLTALLQK GSLGGVLRVR DEEVPSIQSQ LDKLAAGLST
AMNAVHQTGF DLNGNPGGLL FSAPPSDGKN AAALISLAIS DPAQLAMSGN GAAGDNTVAN
ELLQIKNQAV VGNATPIDAY SQIVFNVGSD ISDAQAGLDT STSLLQQLQD QRGALSGVSL
DEETANLLQF QRGFEAAAHL VSVVDEMMQT VIAMGVTQ