Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2403 |
Symbol | |
ID | 5734284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3061786 |
End bp | 3063765 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279544 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001545171 |
Protein GI | 159898924 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000436928 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTTC ATTCTGATCA AACGCTACCG ACTATGCGAG TTGCTCGCCG AATTATTGCT GGATTATTGT TGTTGGCTGT GGTTATTGGG TTGGCGCGAG TTGGGCGGTT TGTCCATGCC CAACCAGCCC AATCAGCGAG CCTCCACGCA AGCGATTGGC AAGCTATTAG CGAACTGTTA CCCCCCAACC AACAAGCCTA TCTCAAGGCC TCGAATACCA ATCTTGGCGA TCTGTTTAGC TCAAGCGTGG CGATCGATGG CAATACAATT GCGATTGGTG CGCCCAATGA ATCAAGTTCG GCAACTGGAA TTAATGGCAA TCAACAGGAT AATAGTGTCA TTAGTTCAGG TGCAGTTTAT ATTTTTGTGC GTACTGGGAC CACCTGGAGC CAGCAAGCCT ATATTAAAGC CTCAAATCCC GATTTCAATG ATCTATTCGG TCATAGCGTG GCATTGTCTG GTAATACCTT GGTGGTTGGG GCGGTCAATG AATCAAGTGA GGCCACCGGA ATTAATGGCA ACCAAACCGA TAACAGCGCG ATGAATGCTG GGGCCGTTTA TGTGTTTGTG CGCAGTGGCA CGACTTGGAG CCAGCAATCC TATCTCAAAG CTTCAAATGC TGAAGCCTTT GATCAATTTG GCTGGATTGT CGCGCTTGAT GGCAATACCT TAGCGGTTGG GGCTAATCTT GAATCGAGCA ATGCGACTGG AGTTAATGGC AACCAAGCCG ATAATAATGC TGTCCGTTCA GGAGCAGCCT ATATATTTGT GCGCACTGGC ACGACCTGGA GCCAACAAGC CTATCTCAAA GCCTCAAATA CCGAGGCCAA CGATAATTTT GCGATGGCAC TTGACCTAAG CGGCGATCGC TTGGTGGTTG GGGCGGTCAA TGAAGATAGT GCTGCCACTG GCATTAATGG TGATCAATCC AATAATGATG CAGCTAGTGC TGGCGCAGCC TATGTTTTTG TGCGCAGTGG CACGACCTGG AGTCAGCAAG CTTACCTCAA AGCCTCAAAT ACCGAGGCCA ACGATTTCTT CGGCGAGAGC GTGACGATCG ATGACTCAAC CGTAGCAGTT GGCGCATGGT GGGAAGATAG TTCGGCTACG GGCGTTAATG GTGACCAAAA TAATAATAAT ACGACCTTTT CAGGAGCAGC CTATGTCTAT AGCTTTGATG GTATGAGTTG GAGCCAGCAA GCCTATATTA AAGCCTCAAA TACTGATACT GAAAATTATT TTGGTCATGC ATTGGTGTTG CGTGGCGATC GGCTGATTGT TAGCGCCTAT GCTGACGATA GTGCGGCCAT TGGCATCAAT GGTGATCAAC AGAATGCTGA TGCTGGCGGT TCTGGAGCGG CTTTTGTCTT TGCACGAGTT GGTACGGTGT GGAGCCAGCA ACACTATTTG AAAGCCTCGA ATACTGGAGT TGAAGATACT TTTGGCTATA CCATGGCAAC CGATGGTTTA AGTTTGGTGG TTGGTGCAAG GTATGAAGAT AGCAATGCAA CCGGAATAGA TGGCAACCAA GCTGATAATA GCGCCGATTT GTCTGGTGCG GCCTATGTGT TTAGCCTAGC ACAATCGGTT GCCTATTTAC CATTGGTCTT TAAACGGATG ACGACCCTGA TTGCTACGAT TAATCCTAAT ACGATTCCGA TTCGCCCGAT CACTGTTCAA GGCGAAACCT TTTTGAGCTC AAGCTTTATA TTGCCCAGTG ATTTGCCTGC AACTGGTACG TATTATCTTT CGGCGAGTCC GACGAGCGTT ATGCCGAGCT TAGTTGATGA TGCGGTGGTA TTGTCTGCCA ACAGCACCCA GATTTTTCGC CATGAATATT CAACCCCTAA TTCAGCGATT GTAACTGTGC CCTATGCGAC CCTCGCTCCG TATGCTGGCC AATCAATCAC TGTTCAATTT AATGATGTTT ATGGCAGCGT GGTTCAGGCC TCGCCAATGT ATCTGATTTG GGTTCCATAA
|
Protein sequence | MAFHSDQTLP TMRVARRIIA GLLLLAVVIG LARVGRFVHA QPAQSASLHA SDWQAISELL PPNQQAYLKA SNTNLGDLFS SSVAIDGNTI AIGAPNESSS ATGINGNQQD NSVISSGAVY IFVRTGTTWS QQAYIKASNP DFNDLFGHSV ALSGNTLVVG AVNESSEATG INGNQTDNSA MNAGAVYVFV RSGTTWSQQS YLKASNAEAF DQFGWIVALD GNTLAVGANL ESSNATGVNG NQADNNAVRS GAAYIFVRTG TTWSQQAYLK ASNTEANDNF AMALDLSGDR LVVGAVNEDS AATGINGDQS NNDAASAGAA YVFVRSGTTW SQQAYLKASN TEANDFFGES VTIDDSTVAV GAWWEDSSAT GVNGDQNNNN TTFSGAAYVY SFDGMSWSQQ AYIKASNTDT ENYFGHALVL RGDRLIVSAY ADDSAAIGIN GDQQNADAGG SGAAFVFARV GTVWSQQHYL KASNTGVEDT FGYTMATDGL SLVVGARYED SNATGIDGNQ ADNSADLSGA AYVFSLAQSV AYLPLVFKRM TTLIATINPN TIPIRPITVQ GETFLSSSFI LPSDLPATGT YYLSASPTSV MPSLVDDAVV LSANSTQIFR HEYSTPNSAI VTVPYATLAP YAGQSITVQF NDVYGSVVQA SPMYLIWVP
|
| |