Gene Acid345_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3800 
Symbol 
ID4071084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4490244 
End bp4491641 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content52% 
IMG OID637985823 
Producthypothetical protein 
Protein accessionYP_592874 
Protein GI94970826 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCTTA CACAGACAAG GCTGCAGAAC GTCTGGCTGA AAGCGCTAAT GGCTTGCTTG 
GTAGGTTATG CCCTACTAGG CAAGGGTTTC GCATACCTTT TCATAGGGGA ATGCGTTCTC
GTTGCTGGGT TCGCGATATT CTTACTTTCT CGCAGAGCGA CGTTAGTGGC GAGCGATAGT
GTGCTTTTAC TCTGGGGACT CTTCGCCTTT TGGGGGGCCT GCCGAACTAT CCCATTTCTA
TCGACGTACC ACTTCGATGC GATCCGAGAC GCAGTCCTTT GGGGGTACGG TCTTACCGCG
CTGTTCATTG TGGCGTCCGT CAATAGCGCG GAGCAAATTG CGAGTGGGCT TAACGCCTAC
CGCAAGTTCC TTCGCTGGTA CATGCCATTA TTGCCAATTA TCCTCCTGCT ATCCGGTCCT
TTGAGGCCGC TGATGCCCGT TGTTCCGTGG AGTCGAGACG CTGCCATCGT TATGTTGAAG
CCCGGTGATG CAAGTGTCCA TCTCGCGGCA GCTGCACTTT TCTGGTTGAT CCTTGAGAGG
CAGAGCTCCG CGCGGAAGAG AAGAGGTTTC TCGGCCATGC AAGGCGTTGC TATAGCCGGA
TGGTTTGGCG CTACGATGTT TGTACTAGTC AGAACTCGTG CTGGGGTTCT TGCCATCATC
ATTCCGATGG CCCTGGTTTC ACTTCTGAAA TTGCGAAGAG TAGCCTGGAA GGTTGGCGTA
TTTGCCGTTG CAGGAACTTT CCTGCTGGCA ATGATCTTGG AATCGAATCT GATCCAGATC
AATATACACG GCCGTAAATT CAGTTCGGAG CAGATTACAA ACAACCTGTA TAGCATAGCG
GGCCAAGGAG ATGAGAAGAC CGATCTTGAG AATACAAAGG TGTGGAGACT TATCTGGTGG
AGGCACATAG TTCAATACAC TGTTTTCGGC CCTTATTTCT GGACTGGAAA AGGCTTCGGA
ATCAATCTGG TGTTGCAAGA TGGCCCGCCC CATGTAACGG AGGACGATAA GACAACGCGT
AGCCCTCACA ATGGAAGTAT GACAGTGCTA GCGCGCATGG GAGTTCCTGG TCTCGTGATG
TGGGCCTCGC TGAATCTCGT GTTTCTCTTC CGAATGCTCC GGGCCTACCG CCGCGCTGCC
CGATCGGGAG CCCAGTTCTG GGCTTCGGTG AATCTATGGG TTCTATGTTA TTGGATTGCT
GCTTTTATCA ACTTGAGTTT CGATGTTTAC ATCGAAGGGC CAGTGGGTGG AATTTTGTTT
TGGTCGATTA TCGGCTTCGG CGTGTCCTGC CTGCGCGTAC AGAGCTATGA AGCGCGTCAG
ATTGCGCACG GCCGAGTGAG AAATTTTCAT ACCAGGTCAG CAGAGCAGTT GGCGGTCAGG
GAATTATCGC CATCCTGA
 
Protein sequence
MYLTQTRLQN VWLKALMACL VGYALLGKGF AYLFIGECVL VAGFAIFLLS RRATLVASDS 
VLLLWGLFAF WGACRTIPFL STYHFDAIRD AVLWGYGLTA LFIVASVNSA EQIASGLNAY
RKFLRWYMPL LPIILLLSGP LRPLMPVVPW SRDAAIVMLK PGDASVHLAA AALFWLILER
QSSARKRRGF SAMQGVAIAG WFGATMFVLV RTRAGVLAII IPMALVSLLK LRRVAWKVGV
FAVAGTFLLA MILESNLIQI NIHGRKFSSE QITNNLYSIA GQGDEKTDLE NTKVWRLIWW
RHIVQYTVFG PYFWTGKGFG INLVLQDGPP HVTEDDKTTR SPHNGSMTVL ARMGVPGLVM
WASLNLVFLF RMLRAYRRAA RSGAQFWASV NLWVLCYWIA AFINLSFDVY IEGPVGGILF
WSIIGFGVSC LRVQSYEARQ IAHGRVRNFH TRSAEQLAVR ELSPS