Gene Haur_4934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4934 
Symbol 
ID5736770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6263638 
End bp6264957 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content50% 
IMG OID641282101 
Productpreprotein translocase, SecY subunit 
Protein accessionYP_001547692 
Protein GI159901445 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000290796 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCAAG CTGTGGTCAA CGCGCTTAAA ATCGCCGAAG TGCGTAAGAA AATTTTGTTC 
ACCATGATGA TGCTGGTGAT CTTCCGCTTT ATCGCGGCTA TCCCAGTGCC AGGGGTCGAT
ACAGCCGAAC TCGATCGGTT GTTTAATGGT CAGTTGCCGT TGAGCGATTT GGCTAACTTC
CTCAACTTGC TCTCGGGTGG GGCGTTGCGG CAGTTCTCGG TCGCAGCCAT GGGGGTCTAT
CCCTACATCA CGGCTCAAAT TATTATGCAG TTGCTGATTC CGTTGATCCC AGCGCTGGAG
CAATTGAGCA AAGAGGGTGA GCAGGGCCGC AATCGTATTC AGCGCTATCA GTATTTCCTC
ACCGTGCCAC TGGCCTATCT CCAAGGCTAT GGCCAAATTA AATCCTTGAT CAACAGCGGG
ATTAATTTGT TCGGTACGTT GAATTTCAGC ATCACCGAAA ACTTCTTCCA AACCTTCTCG
ATCTTGACGA TTATGGTTTC AGGCTCGATG TTCTTGGTTT GGATGGGTGA GTTGATCGAC
GAACGCGGGA TTGGTAATGG GCTCTCGATG ATTATCTTCG GCGGGATTGT TACGGCTCTG
CCAAGTATGG TTTATCAAGC AATCACCACC TCAAGCTCAG GCAATAACGT CATCGGCGGG
ATTTTGCTGC TGATCGTGAC CTTGGCGACC GTGGTGGGGA TTGTGTTGAT CACCGAAGGC
CAACGCCGCA TTCCAGTTCA ATATGCCAAG CGCGTGCGCG GCAACAAGGT CTATGGCGGT
CAATCAAGCC ACTTGCCATT AAAGGTCAAT ATGGCTGGGA TGATTCCCTT GATCTTCGCC
CAAAGCATTT TGATTTTCCC CAGCACCGTT GCTTCCTACT TCTGGAATCC CAACGGTTCG
GGCTTCTTCA ATGGCGTTGC CAACTTCTTC GTCAATAGCT TTGGCCCGAA TGGCTTGGCC
TTTACCTTGG CGATGTTTGT ATTGGTCTTG CTCTTCACCT ACTTCTATGC CTTGGTGATG
TTCAACCAAC AAAATTTGCC TGAAATGTTG CAACGCAACG GCGGCTTTAT TCCAGGCATT
CGCCCAGGTC GCAACACCGA AGTCTATTTG ACCAAAGTCT TGAACCGCAT CACCTTGATT
GGCGCGGTTG GCTTGGCGGT AGTTGCAGCC TTGCCCTATT TTGTGCAACA AATTACCGGC
TTGCAAATTG GCTTTAGCAG CACCGGTTTG TTGATTGTGG TTGGTGTGGC GGTCGATACC
ATGCGCCAAC TCGAAGCCCA AATGATGATG CGCAATTACG AAGGCTTCAT CTCGCGCTAA
 
Protein sequence
MLQAVVNALK IAEVRKKILF TMMMLVIFRF IAAIPVPGVD TAELDRLFNG QLPLSDLANF 
LNLLSGGALR QFSVAAMGVY PYITAQIIMQ LLIPLIPALE QLSKEGEQGR NRIQRYQYFL
TVPLAYLQGY GQIKSLINSG INLFGTLNFS ITENFFQTFS ILTIMVSGSM FLVWMGELID
ERGIGNGLSM IIFGGIVTAL PSMVYQAITT SSSGNNVIGG ILLLIVTLAT VVGIVLITEG
QRRIPVQYAK RVRGNKVYGG QSSHLPLKVN MAGMIPLIFA QSILIFPSTV ASYFWNPNGS
GFFNGVANFF VNSFGPNGLA FTLAMFVLVL LFTYFYALVM FNQQNLPEML QRNGGFIPGI
RPGRNTEVYL TKVLNRITLI GAVGLAVVAA LPYFVQQITG LQIGFSSTGL LIVVGVAVDT
MRQLEAQMMM RNYEGFISR