Gene Haur_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5100 
Symbol 
ID5737058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp128914 
End bp130560 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content52% 
IMG OID641282265 
Productalpha beta-propellor repeat-containing integrin 
Protein accessionYP_001547856 
Protein GI159901610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.908648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC ATACCCCTCA TCAGAAACCC TTTGTATCGG ATAGCCCGCC CTCCAATCGA 
TGGCGTGCAT TCCGCCATTG TGTTGGAATC AGCATGTTGT TTGTACTGAT TCAGGGATCG
GTTCTTGTCT ATACGAGAGG ATCGCTACCG GTAGAACACG CAATCGCGCA ATCAGCCCAA
CACCCGCATG GATCGCCGAA CCTCCTTCCT ACCTATATTA AGGCATCGAA TACCGATCCT
CACGATGCGT TCGGCTTTCG CGTAGCGCTT GATGCTACCA CACTGGCAGT AAGCGCCCCA
TACGAATCAA GTGCCGCAAC GGGTATTCAG GGTGACCAAT CAAATAATAT GGCGCTCCAA
TCAGGAGCCG TGTATATTTT TGTCCGGGAT GGCGATACAT GGGTCCAACA AGCGTATCTC
AAAGCGTCCA ATACCGACGC CGGCGATGGC TTTGGGGTCA GTCTTGCCCT CGATGGGGAT
ACGCTTGTGG TTGGGGCGTA TGCTGAGGAC AGTGCTGCCA CCGGAATCAA CGGCAATCAG
GCCGATAATT CCGCTGCGAA CGCGGGGGCG GCCTATGTCT TTGTCCGATC AGGGTCAACC
TGGAGTCAGC AAGCCTATCT GAAAGCATCC AATACTGATG AAGGCGATGG GTTTGGCTAT
AGGGTTGCGA TTGATGCAAC CACAGTCGTG ATTAGCGCCC GTGGCGAAGA TAGTGGAGCA
ATGGGGGTAA ATAATGATCA GGCGAACAAT GATAAAGTGG ACGCGGGGGC GGCCTATGTT
TTTGTCCGAT CAGGTTCAAC TTGGAGTCAG CAAGCCTATC TCAAAGCATC CAATACCGAT
GCAGACGATG GGTTTGGCTA TAGTGTATCA ATTGAGAACC AGCTGATCGC CGTTGGCGCG
AATGGGGAGG ATGGGAGTAC GACTGGCGTA AATGGAGGGC AGGACGATAA TACTGCTCCG
GACGCAGGGG CGGCCTATGT CTTTGTCCGA TCGGGTTCAA CTTGGAGTCA GCAAGCCTAT
CTCAAAGCAT CCAATACCGA TGCAGACGAT GGGTTTGGAC AGCGTGTTCA GCTTGCAGGA
TCAACGGTAG TGGTGAGTGC CGTTCGGGAA GATAGCGCCG CCACCGGAGT CAATGGCAAT
CAGCATGATA ATACTGCCAT GGATGCAGGA GCGGCTTATG TCTTTGTTCA GAATGGGAAT
ACGTGGAGTC AACAAGCCTA TCTAAAGGCC TCAAATACTA ACGCAGGCGA TGGGTTTGGC
TATAATCTCC ATGCGTTGGG TGATTGGATA CTGATTGGCG CACCATATGA GGCGAGTGCG
GCCACAATCA TCAACGGGAA TCAGCATGAT AATAATGCCA ACCGTGCAGG AGCCGCCTAT
CTTTTTGCGC GGCAACAGAC ACTATGGAGT CAGTCCGCCT ATCTGAAAGC CATGAATACC
GATTCAGGCG ATCTCTTTGG GAATACTATG GGCATGAATG AGTCACTCAT CATCGTTGGA
GCGTCAAATG AAGATAGCAA TACCCTCGGG ATCAATGGCG ATCATGCGAA TAATCTAGCC
CTTAATTCAG GCGCAGTCTA TAGTTTCCCA TTTGCCATGA TTCCTTCCAT ACGGGCGTAT
CTCCCATTGA CCACCCGGGG TGAATAG
 
Protein sequence
MDDHTPHQKP FVSDSPPSNR WRAFRHCVGI SMLFVLIQGS VLVYTRGSLP VEHAIAQSAQ 
HPHGSPNLLP TYIKASNTDP HDAFGFRVAL DATTLAVSAP YESSAATGIQ GDQSNNMALQ
SGAVYIFVRD GDTWVQQAYL KASNTDAGDG FGVSLALDGD TLVVGAYAED SAATGINGNQ
ADNSAANAGA AYVFVRSGST WSQQAYLKAS NTDEGDGFGY RVAIDATTVV ISARGEDSGA
MGVNNDQANN DKVDAGAAYV FVRSGSTWSQ QAYLKASNTD ADDGFGYSVS IENQLIAVGA
NGEDGSTTGV NGGQDDNTAP DAGAAYVFVR SGSTWSQQAY LKASNTDADD GFGQRVQLAG
STVVVSAVRE DSAATGVNGN QHDNTAMDAG AAYVFVQNGN TWSQQAYLKA SNTNAGDGFG
YNLHALGDWI LIGAPYEASA ATIINGNQHD NNANRAGAAY LFARQQTLWS QSAYLKAMNT
DSGDLFGNTM GMNESLIIVG ASNEDSNTLG INGDHANNLA LNSGAVYSFP FAMIPSIRAY
LPLTTRGE