Gene Haur_5142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5142 
Symbol 
ID5737100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp193484 
End bp195073 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content50% 
IMG OID641282307 
Productalpha beta-propellor repeat-containing integrin 
Protein accessionYP_001547898 
Protein GI159901652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0225115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGTCC ACTTCATCAA AAAACGCAAT CCCATGGTCC TTGTTGTGGT CTTTATTGGT 
ATCTGGTTGT TGACGATGAT GCCCCCAACA CGTCATACTG CTGCGATACC GCATGGTCTT
ACCACCCATG ATTGGCAGAT CCTCACATCG TTCTTTCCCC CAACTCAGCA AGCATTTTTT
AAGGCATCCA ATACCGGCCC ACAAGATTTT TTTGGATCTC GTGTTGCTGT AGACGATGAT
ACCGTCGTCA TAACGGCCCC GCAAGAAGAT AGTAGTACAA CCGGGGTCAA TAGTACACCC
AATGAGGAGG CATCGGAATC GGGGGCAGCC TATGTCTTTG TTCGGACGAA TGGGCTTTGG
ACACAACAAG CCTATCTCAA ACCCTCCAAT ACGAGTCCCG GTGATGCGTT TGGAGAAAGT
GTCGCCATTG ATCAGGATAC CATTGTTATT GGTGCCTTTA AGGAAGATAG TAGTACGACC
GGGGTCAATA GTATGCCCAA TGAGGAGGCA TTAAACGCAG GCGCAGCCTA TGTCTTTGTT
CGGACGAATG GGCTTTGGAC GCAACAAGCC TACCTCAAGG CATCGAATAC GGATGCGGAT
GATGCGTTCG GGACAGCCGT AAGTGTTAGC CAGGATAGTA TCGTTGTCAG TGCGATCAAT
GAAGATAGCA GTACGACCGG GGTCAATAGT ACGCCCAATG AGGAGGCATT AAACGCAGGC
GCAGCCTATG TCTTTGTTCG GACGAATGAG CTTTGGACGC AACAAGCCTA CTTTAAAGCA
TCGAATACGG GTGCGGATGA CATGTTCGGA CGAAGCGTTG CCCTTTATGC CACAACCCTC
GTTGTAGGTG CCTATCTTGA GGATAGTAAT ACCCGCGGTG TCAACAATCC GCCCAATGAA
GCAGCACCTG ATGCGGGTGC AGCCTATGTC TTTGAACGGG TCAATGGTCA TTGGGTGCAG
CAGGCCTATC TCAAGGCATC AAATCCAGGG GAAACTGATC GCTTCGGCAT TAGTGTTGCG
CTTGAGAGCA GCACGATTGT GGTAGGGGCG TATCTGGAAG ATAGCAGTAC AGCGGGTGGG
CAGAGCGACC CAAATGAGCA AGCACCCGAT GCGGGTGCAG CCTATGTCTT TGTCCGAATC
ATGGATACAT GGTATCCGCA AGCCTATCTT AAAGCGTCAA ACATCGATGC AGGAGACCGT
TTTGGAGCCA GTGTCGCCGT CCATGGCGAT TTACTGCTGA TTGGAGCGTA CTTGGAAGCA
AGCAGTAGTA GTGGCATCGA CAGTATCCCA AATAATGATG CACCCGGAGC CGGAGCCGCC
TATCTTTTTC TACGAACCAA TCATAGATGG ACACAACAAT CCTACTTAAA AGCATCGAAT
ACCGGACTGA ATGATACATT TGGCATTCGT GGTGCGCTAT ATCAAGGGAC AATCGTGATC
GGTGCATATC AGGAAGATAG TAGTACGGTT GGGGTGAATC CTCCTTCTAA TGAGGAGGCT
TCAGATTCCG GTGCAGCCTA TGTCTACACG ACGACTATGC TTGTGCCGAA TCCCCATCGT
GCATATCTGC CATGGGCTGG ACGGGAATAG
 
Protein sequence
MHVHFIKKRN PMVLVVVFIG IWLLTMMPPT RHTAAIPHGL TTHDWQILTS FFPPTQQAFF 
KASNTGPQDF FGSRVAVDDD TVVITAPQED SSTTGVNSTP NEEASESGAA YVFVRTNGLW
TQQAYLKPSN TSPGDAFGES VAIDQDTIVI GAFKEDSSTT GVNSMPNEEA LNAGAAYVFV
RTNGLWTQQA YLKASNTDAD DAFGTAVSVS QDSIVVSAIN EDSSTTGVNS TPNEEALNAG
AAYVFVRTNE LWTQQAYFKA SNTGADDMFG RSVALYATTL VVGAYLEDSN TRGVNNPPNE
AAPDAGAAYV FERVNGHWVQ QAYLKASNPG ETDRFGISVA LESSTIVVGA YLEDSSTAGG
QSDPNEQAPD AGAAYVFVRI MDTWYPQAYL KASNIDAGDR FGASVAVHGD LLLIGAYLEA
SSSSGIDSIP NNDAPGAGAA YLFLRTNHRW TQQSYLKASN TGLNDTFGIR GALYQGTIVI
GAYQEDSSTV GVNPPSNEEA SDSGAAYVYT TTMLVPNPHR AYLPWAGRE