Gene Haur_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2403 
Symbol 
ID5734284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3061786 
End bp3063765 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content48% 
IMG OID641279544 
Productalpha beta-propellor repeat-containing integrin 
Protein accessionYP_001545171 
Protein GI159898924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000436928 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTC ATTCTGATCA AACGCTACCG ACTATGCGAG TTGCTCGCCG AATTATTGCT 
GGATTATTGT TGTTGGCTGT GGTTATTGGG TTGGCGCGAG TTGGGCGGTT TGTCCATGCC
CAACCAGCCC AATCAGCGAG CCTCCACGCA AGCGATTGGC AAGCTATTAG CGAACTGTTA
CCCCCCAACC AACAAGCCTA TCTCAAGGCC TCGAATACCA ATCTTGGCGA TCTGTTTAGC
TCAAGCGTGG CGATCGATGG CAATACAATT GCGATTGGTG CGCCCAATGA ATCAAGTTCG
GCAACTGGAA TTAATGGCAA TCAACAGGAT AATAGTGTCA TTAGTTCAGG TGCAGTTTAT
ATTTTTGTGC GTACTGGGAC CACCTGGAGC CAGCAAGCCT ATATTAAAGC CTCAAATCCC
GATTTCAATG ATCTATTCGG TCATAGCGTG GCATTGTCTG GTAATACCTT GGTGGTTGGG
GCGGTCAATG AATCAAGTGA GGCCACCGGA ATTAATGGCA ACCAAACCGA TAACAGCGCG
ATGAATGCTG GGGCCGTTTA TGTGTTTGTG CGCAGTGGCA CGACTTGGAG CCAGCAATCC
TATCTCAAAG CTTCAAATGC TGAAGCCTTT GATCAATTTG GCTGGATTGT CGCGCTTGAT
GGCAATACCT TAGCGGTTGG GGCTAATCTT GAATCGAGCA ATGCGACTGG AGTTAATGGC
AACCAAGCCG ATAATAATGC TGTCCGTTCA GGAGCAGCCT ATATATTTGT GCGCACTGGC
ACGACCTGGA GCCAACAAGC CTATCTCAAA GCCTCAAATA CCGAGGCCAA CGATAATTTT
GCGATGGCAC TTGACCTAAG CGGCGATCGC TTGGTGGTTG GGGCGGTCAA TGAAGATAGT
GCTGCCACTG GCATTAATGG TGATCAATCC AATAATGATG CAGCTAGTGC TGGCGCAGCC
TATGTTTTTG TGCGCAGTGG CACGACCTGG AGTCAGCAAG CTTACCTCAA AGCCTCAAAT
ACCGAGGCCA ACGATTTCTT CGGCGAGAGC GTGACGATCG ATGACTCAAC CGTAGCAGTT
GGCGCATGGT GGGAAGATAG TTCGGCTACG GGCGTTAATG GTGACCAAAA TAATAATAAT
ACGACCTTTT CAGGAGCAGC CTATGTCTAT AGCTTTGATG GTATGAGTTG GAGCCAGCAA
GCCTATATTA AAGCCTCAAA TACTGATACT GAAAATTATT TTGGTCATGC ATTGGTGTTG
CGTGGCGATC GGCTGATTGT TAGCGCCTAT GCTGACGATA GTGCGGCCAT TGGCATCAAT
GGTGATCAAC AGAATGCTGA TGCTGGCGGT TCTGGAGCGG CTTTTGTCTT TGCACGAGTT
GGTACGGTGT GGAGCCAGCA ACACTATTTG AAAGCCTCGA ATACTGGAGT TGAAGATACT
TTTGGCTATA CCATGGCAAC CGATGGTTTA AGTTTGGTGG TTGGTGCAAG GTATGAAGAT
AGCAATGCAA CCGGAATAGA TGGCAACCAA GCTGATAATA GCGCCGATTT GTCTGGTGCG
GCCTATGTGT TTAGCCTAGC ACAATCGGTT GCCTATTTAC CATTGGTCTT TAAACGGATG
ACGACCCTGA TTGCTACGAT TAATCCTAAT ACGATTCCGA TTCGCCCGAT CACTGTTCAA
GGCGAAACCT TTTTGAGCTC AAGCTTTATA TTGCCCAGTG ATTTGCCTGC AACTGGTACG
TATTATCTTT CGGCGAGTCC GACGAGCGTT ATGCCGAGCT TAGTTGATGA TGCGGTGGTA
TTGTCTGCCA ACAGCACCCA GATTTTTCGC CATGAATATT CAACCCCTAA TTCAGCGATT
GTAACTGTGC CCTATGCGAC CCTCGCTCCG TATGCTGGCC AATCAATCAC TGTTCAATTT
AATGATGTTT ATGGCAGCGT GGTTCAGGCC TCGCCAATGT ATCTGATTTG GGTTCCATAA
 
Protein sequence
MAFHSDQTLP TMRVARRIIA GLLLLAVVIG LARVGRFVHA QPAQSASLHA SDWQAISELL 
PPNQQAYLKA SNTNLGDLFS SSVAIDGNTI AIGAPNESSS ATGINGNQQD NSVISSGAVY
IFVRTGTTWS QQAYIKASNP DFNDLFGHSV ALSGNTLVVG AVNESSEATG INGNQTDNSA
MNAGAVYVFV RSGTTWSQQS YLKASNAEAF DQFGWIVALD GNTLAVGANL ESSNATGVNG
NQADNNAVRS GAAYIFVRTG TTWSQQAYLK ASNTEANDNF AMALDLSGDR LVVGAVNEDS
AATGINGDQS NNDAASAGAA YVFVRSGTTW SQQAYLKASN TEANDFFGES VTIDDSTVAV
GAWWEDSSAT GVNGDQNNNN TTFSGAAYVY SFDGMSWSQQ AYIKASNTDT ENYFGHALVL
RGDRLIVSAY ADDSAAIGIN GDQQNADAGG SGAAFVFARV GTVWSQQHYL KASNTGVEDT
FGYTMATDGL SLVVGARYED SNATGIDGNQ ADNSADLSGA AYVFSLAQSV AYLPLVFKRM
TTLIATINPN TIPIRPITVQ GETFLSSSFI LPSDLPATGT YYLSASPTSV MPSLVDDAVV
LSANSTQIFR HEYSTPNSAI VTVPYATLAP YAGQSITVQF NDVYGSVVQA SPMYLIWVP