Gene Acid345_0381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0381 
Symbol 
ID4069008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp432546 
End bp433667 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content52% 
IMG OID637982384 
Producthypothetical protein 
Protein accessionYP_589460 
Protein GI94967412 
COG category[S] Function unknown 
COG ID[COG2357] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0160658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.52247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATG ACGAAAACAG CACCGACCGA GTTGATGTCG AAGAGGTGAT GCGGCAGTTC 
GTCGAAAAAC GAGATTTGCT GGAGGCGTTT CGTTCAAAGA CAGAGGGTTT AATCTCCGAA
TTGTTAGACG CCGCCGCCAT TCGGTGCCAA TCAATTCAGT CGCGGGTCAA AACCAACAAG
AAACTTCGGG CGAAATATCT TGACCCGAAG AAGGATTATC GCTCCCTCGA CGAGATCACA
GACCAGGTTG GCTTTAGGAT CATTGTCTAC TATCAAGACG AGATTGATGT AGTCGCCAAA
TTGGTCCGGG ACGAGTTCGA TGTGGATGAA GCGAATTCGG TCGACAAACG AATCACCGAC
CCCGAACGTT TTGGCTATCA GGCGGTGCAC TGTGTTTGTC AGCACTCGTC TGGACGTTCC
AAGATCACGG AATACAAGAA ACATGCGGGG ATCACGTGCG AGATCCAGAT TGCCACGATC
CTCGGCCATG CTTGGGCCGA AATGGAACAT GAGTGGTACG ACCTGCAGGA TGATTTTCCA
GACGATATCA AACGAAAGTT TTCGCGATTG GCTGCGCTCC TGGACCTTGC GGATTCTGAG
TTCTTGGACA TCCGTAAAAA GAAGAGCAGC TATGAGCGAT CGGTAGAACT TCGGATCGAA
GCAAACGTTC CCGATGTCCC GCTCGATTCC GTGTCTTTGA AATCGCTGTT AACTCAGGAC
CCCCACGTGA AGGAAGTCGA TAGTAAGCTG GCAGTGATTT TCGCCAGCGA ACTAGTTCCA
GATCTGTCCG ACGCCGAAGC TCGTCGGAGA TTCCCGATAA TGGAGTTCCT CGGGTTGCAG
AGCGTCCGGT CGGCGCAAGA CAAACTCAGA CAGCACGAAG CGGCACTGTT GGAATTCGCT
ACATTGTCCG AGCAGGGAGT TTGGCGCGAT TGGAAGCTCA AGACACCTAT CATGCCTGGT
ATAGGGTTTT ACCACCTGAT GTTGTTATTC GCGTTCTCCG GAGGCCTAGA GTCTGCCCAA
GTGGCTCTCG CGAAACTCGG AGGGGGGCTG AAAGGTTACC CGCACCTTGA CGAGCAAGTA
AGGATCGCGC AAGCGGTAGC GAAAAAATAC GGGCTCACCT AA
 
Protein sequence
MANDENSTDR VDVEEVMRQF VEKRDLLEAF RSKTEGLISE LLDAAAIRCQ SIQSRVKTNK 
KLRAKYLDPK KDYRSLDEIT DQVGFRIIVY YQDEIDVVAK LVRDEFDVDE ANSVDKRITD
PERFGYQAVH CVCQHSSGRS KITEYKKHAG ITCEIQIATI LGHAWAEMEH EWYDLQDDFP
DDIKRKFSRL AALLDLADSE FLDIRKKKSS YERSVELRIE ANVPDVPLDS VSLKSLLTQD
PHVKEVDSKL AVIFASELVP DLSDAEARRR FPIMEFLGLQ SVRSAQDKLR QHEAALLEFA
TLSEQGVWRD WKLKTPIMPG IGFYHLMLLF AFSGGLESAQ VALAKLGGGL KGYPHLDEQV
RIAQAVAKKY GLT