Gene Acid345_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3904 
Symbol 
ID4072241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4618545 
End bp4620260 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content62% 
IMG OID637985930 
ProductSel1 
Protein accessionYP_592978 
Protein GI94970930 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00495215 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCTGCC CGAAGTGCCA GTCTGAAAAT CCTGAATTGA ACCGCTTCTG TGGAGCGTGC 
GGAAGCCGTT TGCAGGAGAC CGCTCCGGTA GATCAGGCCC GTCCGAGCAA CGGCAACGAT
GCGAACAAAC CCGGAGTTCC GGGAGTCCGT CGTCCGTTCA TCGTTTCAAC CACAGCCATG
GCGGACGTCA CAGCGCAGAT GCTGAACACG CGCGTCGTAC TGCCGTCACC ATCGGTTGCA
AGCGCAACGC ATTCTGGATA CGTGGCTCCG CTGAAGCCGC GCGTGGAACA CATTGCGCAC
GGCGAAGACC TGCCGGAGCC CGAAATCGCG CCGGTTGATC CCAATGATGA ACCGATGTTC
GCCGAGGGCT CCGTAGAACA GCAAGAAGGT ACGGTTCACG AATTCGATCT GGATTCCCCG
GAAGAGAAGG AAGCCGAAGA ATGGCTGGAG CGGACGGTCG CCGAGCACGA AGCGCACATG
CCCCCGCCGC GAACAGAGAC CCCGGGCTCC ATCTTGAATC TGAGCGCGCC GGTTGAATCG
GTGGCGTCTG AGCGGCTCGA AGAGCCGCCC GTAGAGCAAG AGCCGGTTCG CAATTCGTTT
CTTCAATTCG ACCCGCCGTC GGAATCGACT GGCGGCAGCG TATCCGGGCC ATCGTTCCTG
GGATTGGATG AGCCACCGTC GCAGGACTAT CTTCTCGAGG AATCTGGATC CCACACGGGA
AGGAATCTCG TGTTGGTGGC GATCGTGGCC ATCGTCGCCG CGATGGGCTA CCTCGAGTGG
CGTGCGAGTA GCCGTGGTGA ATCCACTAAT CCGGTGGATG TACTGCACTT GAAACTCCCG
AAGAAGAAGG GGCAGGGTCC GGCCGAGGTT GCAACGTCGA CGACGACCTC GCCGGGCGGC
TCTTCGACTA CCGAGAGCGC CAACAATTCC GGCAAACCTG ATCTGATAGC GGAGCCGAAC
CAGCCAGCGG CACAGAGTAG CGCTGCCGCG GGGAATTCTC AGACTTCCGC TACGCCGGCG
CCGAACGCGA ATCCAGGAAC TGCGGAAGCG AATCCGCCTT CGACCGCCGC GGCAACGACG
AATGCGGCAG GTACCTCTTC TCCGCAGCCC GCTGCAGCCG CAACGAAATC GACACCCCCA
CCGGTCGAGA AACAGACGAC CGAAGTCGCA AAGAATACGC CGCCGCCCGC GAAGAAACCC
GAGCCTCTAC CGCAGTCTGA CGCGGCCACC GCGAAGCCGA CCGCCAGTAA GCCGGCGGCA
GCGATCGCAA GCAAGCCCCC GACGCCTGCT GCCCAGACGC AAGAAACTGA TCCGACTCTG
AATGCCGGCG GTGCCGAGCT GCAGAAAGGC AAAGCTGCAG GCGCAACCGA CGATGGCCGC
ATGTGGCTCT GGAAAGCTGT GGCGAAGGGC AACGGCGAAG CTCCTGTACT TCTGGCGGAC
ATGTATCTGC AAGGCAGAGG CGTCCCGAAA GATTGCGAGC AGGCGATGCT GCTGCTCAAC
GCTGCCGCGA AGAAGGCGAA TCCTCGCGCA CGTTCGAGAC TTGGCTCGTT GTATGCCACT
GGCGAGTGCG TTTCCCAGGA TCGGGTGCAG GCTTATAAGT GGATGACCTC GGCACTCGCC
GCGAATCCGG GAAGTGATTG GATCGAAAAG AACCGCCAGC AACTTCTGAG CCAGATGACG
GCGTCGGAGC GCAAGCGCGC CGCTGCAATT CAGTAG
 
Protein sequence
MICPKCQSEN PELNRFCGAC GSRLQETAPV DQARPSNGND ANKPGVPGVR RPFIVSTTAM 
ADVTAQMLNT RVVLPSPSVA SATHSGYVAP LKPRVEHIAH GEDLPEPEIA PVDPNDEPMF
AEGSVEQQEG TVHEFDLDSP EEKEAEEWLE RTVAEHEAHM PPPRTETPGS ILNLSAPVES
VASERLEEPP VEQEPVRNSF LQFDPPSEST GGSVSGPSFL GLDEPPSQDY LLEESGSHTG
RNLVLVAIVA IVAAMGYLEW RASSRGESTN PVDVLHLKLP KKKGQGPAEV ATSTTTSPGG
SSTTESANNS GKPDLIAEPN QPAAQSSAAA GNSQTSATPA PNANPGTAEA NPPSTAAATT
NAAGTSSPQP AAAATKSTPP PVEKQTTEVA KNTPPPAKKP EPLPQSDAAT AKPTASKPAA
AIASKPPTPA AQTQETDPTL NAGGAELQKG KAAGATDDGR MWLWKAVAKG NGEAPVLLAD
MYLQGRGVPK DCEQAMLLLN AAAKKANPRA RSRLGSLYAT GECVSQDRVQ AYKWMTSALA
ANPGSDWIEK NRQQLLSQMT ASERKRAAAI Q