Gene Haur_5059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5059 
Symbol 
ID5737017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp73378 
End bp74598 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID641282224 
Productalpha beta-propellor repeat-containing integrin 
Protein accessionYP_001547815 
Protein GI159901569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.241627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTTACC AACCTATTGG CTTAGTGTGC ACCGGGGATA CACTGGTCGT GGGGGCACAG 
GCTGAAGATA GCGCGGCCAC CGGAATCAAC GGCAATCAGG CTGATAATAC TGCGCCCGAT
ACCGGCGCTG CCTATGTCTT TGTCCGATCT GGCACAACCT GGACCCAACA AGCCTATCTG
AAAGCATCCA ACGCCGAGGC TGGAGATGGC TTTGGCTTCA GTCTTGCGAG TGATACCAAT
CGAATTCTGG TTGGAGCACC CTTTGAGGAT AGCAATGCCT CGGGCAGTAA TAATGGTATC
GGCGAACAGA ATAATGATCT TCCGGGCGCA GGGGCAGCCT ATCTTTTTCA CCAGTTCAAT
GGACTATGGA CGCAAGAGGC ATATCTTAAA ACCAATCATC GTACAAAAGA TGAAGCCTTT
GGTCACGCCG TGGCCATGGA GGAAGGAACC ATTGTTATCG GATCACCCTA CGCAGATGGG
TATGAGGCCG TAAAAACAGG ACTGATCACC GTCTTTGTTT ATCAAGGAGC AGGAGTAGGA
TGGCATCACA GCCAAACCAT GGGAAGTCCA GGCCCAAATA CGGGCGATGG ATTTGGACAA
TCGGTCGCGA TTACCAATCA GCGCATCGCG GTAGGAGCCT ATGGCGAAGA TAGTAATGCG
ACCCTGATTA ATGGAGATAG CAGCAATAAT ACGGCAGCAA ATGCTGGGGC AGCCTATATC
TATGATCGCC ATCCAACCTT TTATGAACAT GTGTGGCATC CAACGACCTA TATCAAGGCA
TCGAATACGG ATGCCGTGGA TATTTTTGGC CGGAATCTTG CCTTCTGTGG CCCAACCTTG
CTCGTCGGAG CACCCTATGA AGATAGTGCC GCACAAGGCA CCAATGGCAA CCAAACCAAT
AATAGTCTCG CGAGTGCGGG GGCCGTCTAT CGCTATATCT GGGATGGCTC GCAGTGGCAG
CATCGGCATT ATAACAAAGC CCTCAATCCT GATGCGCTTG ATTATTTTGG CATGAGGCTC
GCCTGTCATG ATCAACTCCT TGCCGTTAGC GCCCCGGGTG AAGATAGCGC TGCCCAAGGA
GTCAATGGCG ATCAAACAGA TAATTCGGCA CTCGATGCTG GTGCGGTGTA TGTCCTCAGC
CTGCCGATGC AAGGCTACAC CCACCTTCCT GCGGTGACCG GTGAAGAAAT CACATCGCCC
TATCCGCTGC CGCAACGCTA A
 
Protein sequence
MCYQPIGLVC TGDTLVVGAQ AEDSAATGIN GNQADNTAPD TGAAYVFVRS GTTWTQQAYL 
KASNAEAGDG FGFSLASDTN RILVGAPFED SNASGSNNGI GEQNNDLPGA GAAYLFHQFN
GLWTQEAYLK TNHRTKDEAF GHAVAMEEGT IVIGSPYADG YEAVKTGLIT VFVYQGAGVG
WHHSQTMGSP GPNTGDGFGQ SVAITNQRIA VGAYGEDSNA TLINGDSSNN TAANAGAAYI
YDRHPTFYEH VWHPTTYIKA SNTDAVDIFG RNLAFCGPTL LVGAPYEDSA AQGTNGNQTN
NSLASAGAVY RYIWDGSQWQ HRHYNKALNP DALDYFGMRL ACHDQLLAVS APGEDSAAQG
VNGDQTDNSA LDAGAVYVLS LPMQGYTHLP AVTGEEITSP YPLPQR