Gene Acid345_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1246 
Symbol 
ID4069821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1522335 
End bp1523726 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content58% 
IMG OID637983255 
Productprotein translocase subunit secY/sec61 alpha 
Protein accessionYP_590322 
Protein GI94968274 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.031052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00314824 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTGAGA AGCTGGCAAA TATCTTCCGG ATCCCGGACC TTCGCAAGCG CGTGTTTTTC 
ACGCTTGCGC TCCTGGCCGT GTATCGCATC GGCGGGCACA TTCCGACGCC GGGCGTGAAC
GCCGATATGC TCCAGCAGTT CTTCCAGAAC AACCGCGGTT CGGTGCTGGG CTTTGTCGAT
CTCTTCAGCG GCGGCCAGCT TCGCCGCTTG ACCATCTTCG CTCTCGGCAT CATGCCGTAC
ATCACGGCGT CGATCATTCT CCAGCTTCTG ACCGTGGTTT ACGAACCGCT GGCGAAGCTG
CAGAAGGAAG GCGAACTCGG GCGCAAGAAG ATCACGCAAT GGACGCGTTA CCTGACCTTT
GGCCTCAGCG CCATGCAGTC GTTCGGTATC GCCGTGACGT TGACGAAGCA GCCGGGCATG
GTCCTGAATC CTGGTTGGGG CTTCATCCTG ATGACGATGC TGACGCTCAC GACCGGCAGC
GTATTCATTA TGTGGCTGGG CGAGCAGATC ACTGAGCGTG GTATCGGCAA CGGCATGTCG
CTGCTGATCT TTGCTGGTAT CGTGGTTGGC CTCCCGCGTG GCGTTGCTGA CCTGGTGGAC
AAGATCAAGA CGCAGTACTG GGGACCGTTC ACGGTTCCGG CGATGTTGCT TCTCATTCTG
GTGATGTTCC TGATCGTTGC GTTCATCGTG TACGTGGAGC GCAGCGAGCG ACGGATTACA
GTGCAGTACG CAAAGCGCAT TGTCGGCCGC AAGATGATGG GCGGAACCTC GACCTTCCTG
CCGCTGCGTG TGAACTCCGG CGGCGTGATG CCGGTGATCT TTGCGTCGTC GATCCTGACC
CTGCCGCAGA CGGTGGGCAT GTTGGGCAGC GTGAGCAAGT ACCACTGGGT GAAGAACCTG
ATGGACCAGC TCAAATGGGG CGAGCCCCTG TACACGATGC TGTACGCGTT GGGCATTGTC
TTCTTCGCAT ATTTCTACGT GTCGATCGTG TTCAACCCGA ACGACGTTGC GGATAATATG
CGCAAGTACG GCGGGTTCAT CCCGGGCATT CGTCCGGGTG CGCGTACCGC AACGTACATC
AACGATATCC TGACCCGCAT TACGCTGGTC GGCGCGTTGT ACCTGATCAT CATCAGCTTT
ATTCCAGAGT GGATGATGGT TGGTTTACAT CTGAACCACT TGCCGTTGTG GTTGGGTGGC
GGACTGTTTG AAAAGCTGCC GACCTGGATG ACCACCGGCC TTGGTGTAAC CTTCTACTTC
GGCGGCACCT CGCTGCTGAT CGTGGTGGGC GTTGCCATGG ACACGGTGCA GCAGATCGAA
TCGCAGCTCA TCATGCGGCA TTACGAGGGC TTCACGCCAC GGAGCGGTCG AATTAAAGGC
CGCCGCTGGT AG
 
Protein sequence
MFEKLANIFR IPDLRKRVFF TLALLAVYRI GGHIPTPGVN ADMLQQFFQN NRGSVLGFVD 
LFSGGQLRRL TIFALGIMPY ITASIILQLL TVVYEPLAKL QKEGELGRKK ITQWTRYLTF
GLSAMQSFGI AVTLTKQPGM VLNPGWGFIL MTMLTLTTGS VFIMWLGEQI TERGIGNGMS
LLIFAGIVVG LPRGVADLVD KIKTQYWGPF TVPAMLLLIL VMFLIVAFIV YVERSERRIT
VQYAKRIVGR KMMGGTSTFL PLRVNSGGVM PVIFASSILT LPQTVGMLGS VSKYHWVKNL
MDQLKWGEPL YTMLYALGIV FFAYFYVSIV FNPNDVADNM RKYGGFIPGI RPGARTATYI
NDILTRITLV GALYLIIISF IPEWMMVGLH LNHLPLWLGG GLFEKLPTWM TTGLGVTFYF
GGTSLLIVVG VAMDTVQQIE SQLIMRHYEG FTPRSGRIKG RRW