Gene Haur_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1865 
Symbol 
ID5733754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2199307 
End bp2201034 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content53% 
IMG OID641279009 
Productalpha beta-propellor repeat-containing integrin 
Protein accessionYP_001544636 
Protein GI159898389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGGT CAGGATTCGC ACCGCGTTGG TTTCGACGGA TCGTTTGGAT GGTTTTGCTC 
GTGCAAGTAG TCGTGATCGC TGCAACCATC AGCAGTTCAC CAACTTCTGT CATGGCAACA
GCCCCTTTGT CCTCGGCAGA CTGGGCCGCA ATCCGTACCC TATTGCCATT GGAGCAACAA
GCCTACTTTA AACCGTCGCA GGTCTCAGAA ACAGATTATT TTGGCCTGAG TGTGGCCATG
TCGGGCGACA CCGTGGTCGT CGGTGCGCCA TATGAGAATA GCAGCACGGC AGGAGTCCAG
AATAGTGCCA CCCCAACCGT CGATGAATTA GCACCTGAAT CGGGGGCTGC CTATGTCTTT
GTACGCAATG GGCCAACTTG GAGTCAACAG GCCTATCTCA AAGCCTCACA AGTTTCGATA
GCAGACCTTT TTGGCTCGAG TGTGGCAGTG TCGGGCGATA CGATTGTGGT AAATGCGCTC
TATGAAGATA GCAGCACAAC AGGAGTTCAA AATAGTGCCA CTCCGACCGT TGATGAAGCT
GCGGATAGTG CTGGTGCAGC CTATGTCTTT GTGCGCAACG GGACAACTTG GAGTCAACAG
GCTTATCTCA AGGCCTCGCA AGTCTCAGAA ACAGACTTTT TTGGCGGGAC TGTGGCAGTA
TCGGGTGACA CCATTGTAGT GGGTGCACAC TTTGAGGATA GCAGCACGGC GGGAGTTCAA
AATAGTGCTA CTCCAACCGT TAATGAAGCA GCAGACAATG CTGGTGCAGC CTATGTCTTT
GTGCGCAATG GGGCAACCTG GAGTCAACAG GCCTATCTCA AAGCTTCACA GGTCACTGGA
TTATCCCCCT TTGAAGAAGG CCGCTTTGGC TGGAGCGTGG CGGTGTCGGG CGATACGATT
GTGGTTGGTG CGACCTATGA GGATAGTACT ACGGCAGGAG TCCAGAATAG TGCCACCCCA
ACCGTCGATG AATTAGCACC TGAATCGGGG GCTGCCTATG TTTTTGTGCG CAACGGGACA
ACCTGGACGC AACAGGCTTA TCTCAAGGCC TCAAATGTCT CAATGTATAA TTACTTTGGC
AGGAGCGTGG CGGTGTCGGG CGATACGATT GTGGTTGGTG CGCCCTATGA GCGTAGCAAC
ATAGCAGGAG TGCAAAACAG TGCCACACCG ACCGTGGATG AAAGCGCATT TACATTTATG
GCGGGAGCGG CGTATGTGTT TGTGCGCAAT GGAACTCAGT GGAGCCAACA GGCTTACCTA
AAGGCTTCCC AAGTGTCGAG TTCTGACGGC TTTGGCTGGA GCGTGGCGGT ATCGGGCGAC
ACTGTGGTTG TCGGGATACC CAATGAGGAT AGTGATACGG CAGGTGTTCA GCAGAGCGAC
ACCCCCGTGG TGAATGAAGA TGCGAGCGAT TCTGGGGCAG TAGTGGTGAT GGTACGCAAC
GGGACAACAT GGAGCCAGCA GGCCTATCTC AAAGCCTCAA ATGTCTCGTC ATTTGATGTG
TTTGGGAACG CCGTTGCGGT TGCTGGTGAT ACCGTGATCG TGGGTGCACC TTTCGAGAAT
GGTAGTATCG CGGGCATTCA GTATGGTTCC AGCCTCGTAG TGGATGACGA TGTGATCGAT
GCTGGAGCCA TCTATAGTTT CAGGCTCCCT GTCGTTGATC CATATTTGAT GTATCTGCCG
TTTGTGGCAA CCAGTGATCG TGCCGCAGCG CAGACCGCCG CGCAATAG
 
Protein sequence
MDRSGFAPRW FRRIVWMVLL VQVVVIAATI SSSPTSVMAT APLSSADWAA IRTLLPLEQQ 
AYFKPSQVSE TDYFGLSVAM SGDTVVVGAP YENSSTAGVQ NSATPTVDEL APESGAAYVF
VRNGPTWSQQ AYLKASQVSI ADLFGSSVAV SGDTIVVNAL YEDSSTTGVQ NSATPTVDEA
ADSAGAAYVF VRNGTTWSQQ AYLKASQVSE TDFFGGTVAV SGDTIVVGAH FEDSSTAGVQ
NSATPTVNEA ADNAGAAYVF VRNGATWSQQ AYLKASQVTG LSPFEEGRFG WSVAVSGDTI
VVGATYEDST TAGVQNSATP TVDELAPESG AAYVFVRNGT TWTQQAYLKA SNVSMYNYFG
RSVAVSGDTI VVGAPYERSN IAGVQNSATP TVDESAFTFM AGAAYVFVRN GTQWSQQAYL
KASQVSSSDG FGWSVAVSGD TVVVGIPNED SDTAGVQQSD TPVVNEDASD SGAVVVMVRN
GTTWSQQAYL KASNVSSFDV FGNAVAVAGD TVIVGAPFEN GSIAGIQYGS SLVVDDDVID
AGAIYSFRLP VVDPYLMYLP FVATSDRAAA QTAAQ