Gene Haur_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1931 
Symbol 
ID5733820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2339802 
End bp2341856 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content53% 
IMG OID641279075 
Productalpha beta-propellor repeat-containing integrin 
Protein accessionYP_001544702 
Protein GI159898455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0195615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACCC CAAAACCTAT CGTTTATGCC CGCATGCTGC TTGGATTGTT ATTGATTAGC 
ATAAGTTTTG GCCTTGGATT GCCGCGAATC GCCGCCCAAA CTCCCCGTGA CGAGGCAACG
GAGCGCTTAA ACTCAGCCGA TTGGGCGGCA ATTCAAGGCC TGCTTGCACC AACTACGGCG
ATCTCTGGCT CCAAATTTCA AGCTGGCTAC TTGAAGGCCG CCCAAGTTTC GGCTGGCGAT
AGCTTTGGCG CGAGTGTGGC AATTTCTGGC GATACGGTGG TGGTCGGCGT TCCAGCCGAG
TCAAGCAGTT TGGCAGGTGT GCAAAATAGT GCTACGCCGA CCATCAATGC TTTGGCTGCC
CAAGCCGGTG CCGCCTATGT GTTTGTACGG GGGAGTGCTG GTTGGCAGCA ACAGGCCTAC
CTCAAAGCCT CTGAGGTGAG CGCCAATGAT CAATTTGGTT GGAGTGTGGC GATCTCCGGC
GATACGATTG TGGTTGGCTC ACCGAGAGAG AGCAGTAGTA CGGTTGGCGT GCAAAATAGT
GCGACACCAA CCGTCAATAA TGATCTTGGT GGGGCTGGCG CGGCCTATGT GTTTGTGCGC
ACTGGCTCAA CTTGGAGTCA ACAAGCGTAC TTCAAAGCCT CACAAGTTAC ATCTGGCGAT
TGGTTTGGCT GGAGCGTGGG TTTAGCATCA AACACGATTG TGGTTGGAGC TTATGGCGAA
GATAGCAATA TCGTTGGCGT GCAAAATAGT GCTACGCCGA CCGTCAATGA AGCGGCTACC
GCAGCGGGTG CGGCCTATAT ATTTGTGCGC ACTGGCTCAA CTTGGAGCCA ACAAGCCTAC
CTTAAAGCTG CCCAAATTAA TACTGATGAT GTGTTTGGGT GGAGTGTTGC AGTAGCTGGC
GATACGGTGA TCGTTGGTGC GCCAGGTGAG GACACCAGCA TTGCAGGAGT GCAACATAGT
GCTACGCCGA CGGTGGATGA AACAGCTCTC GCAGCAGGGG CAGTCTTTGT ATTTACCCGC
AGTGGCTCAA GCTGGAGTCC GCAAGTCTTT GTGAAGGCCT CACAGGTTAC ACCTGGCGAT
CAATTTGGCT ATAATCTGGC GATCGCTGGC AATACGATTG TGGTTGGCGC TCCCTACGAA
GATTCAAGCA CCTCGGGCGT GCAGCATGGC GCTAGCCCAA GTGTCGATGA GCTAGCATCG
TTCGCTGGAG CTGCCTTTGT TTATACCCAA AACGCGGGAG TATGGAGCCA ACAAGCCTAC
CTCAAAGCCT CGAATGTGGC GGATGGTGAT CGCTTTGGCA TGAGTGTGGC GATTGATGCC
AATACGATTG TGGTTGGTGC GCCGGAAGAG GATAGTGGCA TTGCTGGGGT GCAAAATAGT
GCACTACCCA ACGCTGATGA GAGCGCCATC CAAGCAGGAG CCGCCTATGT GTATGCACGG
AATGCTACAA CTTGGAGCCA GCAAAGTTAT CTAAAGGCTT CGCAGGTTTC GACTGGCGAT
TATTTTGGGC GTGGGGTTGG GGTGGCGGGC GATATGCTGG TGATTGGGAT TCCGCTTGAA
GATAGTGCTC GCAGTGGCAT CCAAAATAGT GCTACGCCGA GTGTTGATGA ACTTGCTGCT
GATTCAGGTG CGGCCTTAAT TATTGATACA ACCTATCGTT CGTATAGCCC GTTGGTTCAG
CGCATGACCC TCTTGGCCTT GCTTACGATC AATTCGACGG CTATTCCCAT TCGGGCGGTG
ACGCAACAGG GCGAAGTATT TGCAAGCTTC ACGGCAACAT TGCCTACCAC GATTCCAGCT
GGTGGCCATA TTTATCTTTC GGCTAGCCCT AGCTCGTTGC AACCAACCTT GGTCGATGAT
CGGATTATCA TTCGCGATGG GGCTAACATA ATTTTTCAAC ATACCTACGA TCTTGAGAGC
AATGGCGAGC TTGTTGAGAT TCCGTGGGAA GTTATTAATG CGGCTAGCGG CCATAGCCTG
ACGATCACTT TTGTTGATGT CTCGGCAGGG TTGGTTGGGG CAACGCCGAT CTATCTGATT
TGGGTGGCTG AGTAG
 
Protein sequence
MSTPKPIVYA RMLLGLLLIS ISFGLGLPRI AAQTPRDEAT ERLNSADWAA IQGLLAPTTA 
ISGSKFQAGY LKAAQVSAGD SFGASVAISG DTVVVGVPAE SSSLAGVQNS ATPTINALAA
QAGAAYVFVR GSAGWQQQAY LKASEVSAND QFGWSVAISG DTIVVGSPRE SSSTVGVQNS
ATPTVNNDLG GAGAAYVFVR TGSTWSQQAY FKASQVTSGD WFGWSVGLAS NTIVVGAYGE
DSNIVGVQNS ATPTVNEAAT AAGAAYIFVR TGSTWSQQAY LKAAQINTDD VFGWSVAVAG
DTVIVGAPGE DTSIAGVQHS ATPTVDETAL AAGAVFVFTR SGSSWSPQVF VKASQVTPGD
QFGYNLAIAG NTIVVGAPYE DSSTSGVQHG ASPSVDELAS FAGAAFVYTQ NAGVWSQQAY
LKASNVADGD RFGMSVAIDA NTIVVGAPEE DSGIAGVQNS ALPNADESAI QAGAAYVYAR
NATTWSQQSY LKASQVSTGD YFGRGVGVAG DMLVIGIPLE DSARSGIQNS ATPSVDELAA
DSGAALIIDT TYRSYSPLVQ RMTLLALLTI NSTAIPIRAV TQQGEVFASF TATLPTTIPA
GGHIYLSASP SSLQPTLVDD RIIIRDGANI IFQHTYDLES NGELVEIPWE VINAASGHSL
TITFVDVSAG LVGATPIYLI WVAE