Gene Acid345_0535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0535 
Symbol 
ID4069993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp659452 
End bp660660 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content59% 
IMG OID637982540 
Producthypothetical protein 
Protein accessionYP_589614 
Protein GI94967566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.492189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGAA AGTCATTAAC CGTTATCGCC GTCTGTCTTC TTCCTCTCTC TGCATCCCTT 
GCGCAGACCA CGCCACCCTC CTGTCACTCA GCAGCCGCCG GACAAAACGG TAGCCTCGCC
GGCTACGTGC CCTTTGGTCC GAGCAGTCCT TGGAACCAGG ACGTCTCTAA GGCGACGGTG
GATCCCAACT CATCGGCGAT TATCGGTTAC GTTGGGACGA CGAAGCCTCT CCATCCGGAC
TTCGGCGCTG GCCTCTATCA AGGCTCAACC ATGGGCATCC CATACATCGT CGTGACCTCC
GCGACGCCGA ATGCCACCAT CCATTTCACG GATTCACCCG GCGAAAGCGA CCCGTCGCCG
ATGCCGGTTC CGAAAACTGC ACCGATTGAG GGTTATCCCG CGCCAGGCAG CGGCGATCGC
CACGTGCTCG TACTCAACAC CACGACTTGC TGGCTCTACG AGCTTTACTC CGCCTATCCC
AACACCGATG GAAGTTGGAA CGCCGGGTCC GCCGCAATCT TCGATCTGAG CACTACTGCT
TATCGTCCCT GGGGATGGAC GTCCGCCGAT GCTGCTGGCC TTCCTATCTT CGCCGGCCTC
GTCCGCTACG ACGAAATCGT GAACGGCCAC ATTGATCACG CATTGCGATT CACGCTGCAT
AACAGCAAGC AGGCGATGAT CTCGCCCGCG CGTCACTGGG CTGCGAATTC GTCGGACACC
CTCGCCGCGC CCATGGGCCT GCGTTTCCGT CTCAAAGCCA GCGTGGACAT TTCGAAGTAC
TCCAAGACCA ACCAGATCAT CCTCACCGCG CTGAAGAAGT ACGGCATGAT CATGGCCGAC
AACGGCACGA GCATGTACCT GAGCGGCACA CCCGACGATC GCTGGAGCAA TGACGATCTG
CACAACCTCA CCCAACTCAC CGCTAACGAT TTCGAAGTCA TCAAGCCAAC TGCGGTTTAC
ACGACCCTTC CAACCGGCGC ATCGCCGGTC ATCACCAGTT TCACCGCGTC GGCTTACAGC
ATCACCGCCG GAACCAAAGT GACATTGAGT TGGGCTGCGA CCGGCGCCAC CTACTACAGC
GTCGCGCCAC TGGGCATGCA GCGCGGCACG TGGATGACCG TCGCTCCTAC GAAGACCACC
ACCTATACGC TCTACGCGAC CGGTTCGTAT GGAAGGACGC AGGCGACTTT GACAATCACC
GTTCACTAG
 
Protein sequence
MLRKSLTVIA VCLLPLSASL AQTTPPSCHS AAAGQNGSLA GYVPFGPSSP WNQDVSKATV 
DPNSSAIIGY VGTTKPLHPD FGAGLYQGST MGIPYIVVTS ATPNATIHFT DSPGESDPSP
MPVPKTAPIE GYPAPGSGDR HVLVLNTTTC WLYELYSAYP NTDGSWNAGS AAIFDLSTTA
YRPWGWTSAD AAGLPIFAGL VRYDEIVNGH IDHALRFTLH NSKQAMISPA RHWAANSSDT
LAAPMGLRFR LKASVDISKY SKTNQIILTA LKKYGMIMAD NGTSMYLSGT PDDRWSNDDL
HNLTQLTAND FEVIKPTAVY TTLPTGASPV ITSFTASAYS ITAGTKVTLS WAATGATYYS
VAPLGMQRGT WMTVAPTKTT TYTLYATGSY GRTQATLTIT VH