Gene Acid345_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4167 
Symbol 
ID4072126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4933429 
End bp4934727 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content59% 
IMG OID637986198 
ProductPyrrolo-quinoline quinone 
Protein accessionYP_593241 
Protein GI94971193 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.275915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAT CCGCAGTAGC AGTGTTACTT TTCTCGACGA TCGCGGTGGC CCAGGATGCG 
GCGATGTTCC GTGGAAATCC GGAGCACACA GGGGTGTATA ACGCGGCGGG GGCGCCGCAG
TTTCATTCGC TGAAGTGGAA GTTCGAGACG AAGGCGCAGC TTCACAGCAG CCCGGCGGTC
GCGAATGGCG TGGTGTATGT GGGTAGCACG AGCCGGAACC TGCTGGCCGT CGATCTCGAG
ACTGGCAAGC TGAAGTGGAA GTTCGAGACC GGGGCGAGGA TTGTTTCGTC GCCGGCTGTG
GTGGATGGCG TGGTGTATGT CGCGTCGTAC GACGGCAACT TCTACGCGGT GGATGCGGTG
ACCGGCAAGG AGAAGTGGAA GTTCAAGACC GGCGGGGAAA AGCGATTTGC GGCGCCACAT
CTGCATGGAT CTACGCCGGT AACGGAGACG ATGCCGGACC CGTTTGATTC TTATCTCTCT
TCGCCGGTGG TGGTTGCGGG AGTGGTGTAT TTCGGGAGCG GCGATACGAA CGTGTATGCG
CTGAACGCAG CGGACGGGTC GCTGAAGTGG AAGTTCAAGA CTGGCGATGT GGTGCATGCG
TCACCGGCGC TGGCGGATGG GACATTGTAC GTCGGGAGTT GGGACAGCTA CTTCTACGCG
ATTGATGCCG CGACAGGCAA GGAGAAGTGG AAGTTAAAAA CCGGCGAGGA CCACGACATC
TACAACCAGG TGGGCATTCA GTCTTCGGCT GCGGTGGCCG ATGGAGTCGT TTACTTCGGG
TGCCGCGATT CCAACTTCTA CGCGGTGGAT GCGAAGACAG GTGAGAAGAA GTGGGCGTTC
AACAACAAGG GGTCGTGGGT GATCTCGTCG CCGGCGGTGA AGGATGGGCG CGTGTACTTT
GCGACGTCGG ACACGGGGCT GGTTTATGCG CTCGATCTGA ACGGAAAGGA AGTTTGGCAT
TTCGATGGGA AGCATTGGGT GGTGTTCTCG TCGCCGGCAA TTGCGGGGAA TACGCTGTAT
CTGGGATCGC ATGCGGGGAA GCTGCGGGCG ATTGATCTGA CGTCGGGGAA GTTGGCGTGG
GAATTTGAGA CGGATGGATC GAAGGCCAAC GGCGCCGGAC TCACGAAGCC CGATGGAACA
CCGAATTACG AAGCGGCTTT CCATACGAAT TTCTACGACG ACATGGTGGC GGGGGTGCAG
ATCATGTTGA AGACGGGAGC GATTTTGGGA TCGCCGGTGG TGAGTGGGGA TACGGTGATC
TTTGCCAGCA GCGACGGGAA TGTGTATGCG GTGCGGTAA
 
Protein sequence
MLKSAVAVLL FSTIAVAQDA AMFRGNPEHT GVYNAAGAPQ FHSLKWKFET KAQLHSSPAV 
ANGVVYVGST SRNLLAVDLE TGKLKWKFET GARIVSSPAV VDGVVYVASY DGNFYAVDAV
TGKEKWKFKT GGEKRFAAPH LHGSTPVTET MPDPFDSYLS SPVVVAGVVY FGSGDTNVYA
LNAADGSLKW KFKTGDVVHA SPALADGTLY VGSWDSYFYA IDAATGKEKW KLKTGEDHDI
YNQVGIQSSA AVADGVVYFG CRDSNFYAVD AKTGEKKWAF NNKGSWVISS PAVKDGRVYF
ATSDTGLVYA LDLNGKEVWH FDGKHWVVFS SPAIAGNTLY LGSHAGKLRA IDLTSGKLAW
EFETDGSKAN GAGLTKPDGT PNYEAAFHTN FYDDMVAGVQ IMLKTGAILG SPVVSGDTVI
FASSDGNVYA VR