Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0332 |
Symbol | |
ID | 4711284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 374975 |
End bp | 376636 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639854792 |
Product | fimbrial assembly family protein |
Protein accession | YP_001001928 |
Protein GI | 121997141 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.715461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGTCTGG CGTGGGAGTA TGGTGTTCTG TCGATATCCG CATTAAACGT CAGGCTTCCG CCGCGGATCC CCCCTCCCTT GGCAAGCCTT AACAGGCACT GGGACAAGCA AGGTGACCTT GTGTTGAACG ACCTCAGGCA GTGGCTGCCC CTGCCCGGAC TCCGCCGTCG GACGCGGGTC GGGGTGTTCC TCGGCAACGA GCATATAGCA CTGGCAGCGG TCTCCAGCGA TGGCGAGCAA CTGCTGGCCT GCGACTTCCG GGACGCCCGG CGCGACAACC AGCAGATGGC ACTGCGCGAT CTGGTCGAGC AGTACGGCCT GGGTGGCTCC GAGGCCGTGG TCGTCCTCGA CGGCACCGAC TACCAGACCC AGCAGGTCGA TGCCCCGCGG GTCCCCGATG AAGAGCTCTC CGGGGCCGTG CGGTTCCAGC TCAAGAACCT GCTCTACATC CCGCTGGAGC AGGCCATGGT CGGGGCCCAC CGACAGCACA GCGATCGCTG GAATCAGGAG GGGCAGCGCG CCCTGGCGAC CATCGCCTCG CGCACGCGGA TCGAAGGGAT CCAGGAGCTG GTCGCCCGCG CCGGGCTCAA GCTCCAGGCC GTACTTCCCC GCGAGACGGT CCTCAACGAT CTAAGCGCCG CGGCGACCGA GGGCGCCGGC GGGATCGTCC TCGCCACCCT CGGACGGGAC GACGGGCTGA TCACCATCAG TCGCGGCGAA CTCCTCTACC TGGCCCGTAG CCACTCGGTG GGCACGCGGC GGCTGGCTGA GGACGGCCAG GCCGTCGAGA TCCTCGAGGA TGAGCTGCGT CGCTCGATCG ACTACTTCGA CGGGCAGCTG TCCACGGGGC CGGCGAGCCG GATACTGCTC GCGCCCTGCG AGGCGAATCG TGAGCCGCTG ATTGACCGTT TCAACGACAG CTTCGAGATC CCCTGCGCCC GGCTGCGCCT CGAACAGATC TTCGACCTGG AACCGCTCGG TGATGAGCTC GACGAGCACA CCGAGGCCCA CTGCCTACTG GCTGTGGGCG CGGCCCTGCC GCGGCCCGCC GAGGCGAGCC TGTCGATGTA CGTTCGTTCG CGTCGGCAGC TGGAGCCCCT GTCGCCGGCA GCGCTGGGGA GCTATGTGGC CGGCGGGGCG CTCTTCCTGG GCCTGATCTC GGCGGTGCAC ACGCCGCTGT CGCTCGATCG GGAGGGGCGT GCCGCGGAGC GCGAGGCGCA GCGGGACGAG CTGCTGGCGT CGGTGGCGGA CCTGGAGGCG GAGCTCGAGG CGCGGGAGAT CGACCCCAGC CTCCTCGACG AGCGCGAGGC CATTGAGCGG GACCTCGCCC TGCTGCAGCA GTTCGAAGCC CGGCTGGACA CCCTGGATGA CCGCGCCCTG GCCGGCTTCT CGGAGCCGCT GCGTGGCCTG TCGCGCCAGC GCGCGGAGGG GGTGTGGCTG ACCCACATCC GGCTGCGCTC CGGCGCCGGC GTGTTTCAGG GGCGGGCGGT GGCGGCGGAG GATGTACCTG CCTTCCTCGA CGGCCTGGCC CAAGAGCGCG CCTTCCAGGG GTGGCAGTTC GAAGAGTTCC ACATCCAGCG CGCCGCGGCT GCGGAGGATA CCGCCGATAG CGTCCGTTTC CGCGTGGCCA GCCCCGGTCT CGCCGGTGAC GGAGAGGAGT AG
|
Protein sequence | MRLAWEYGVL SISALNVRLP PRIPPPLASL NRHWDKQGDL VLNDLRQWLP LPGLRRRTRV GVFLGNEHIA LAAVSSDGEQ LLACDFRDAR RDNQQMALRD LVEQYGLGGS EAVVVLDGTD YQTQQVDAPR VPDEELSGAV RFQLKNLLYI PLEQAMVGAH RQHSDRWNQE GQRALATIAS RTRIEGIQEL VARAGLKLQA VLPRETVLND LSAAATEGAG GIVLATLGRD DGLITISRGE LLYLARSHSV GTRRLAEDGQ AVEILEDELR RSIDYFDGQL STGPASRILL APCEANREPL IDRFNDSFEI PCARLRLEQI FDLEPLGDEL DEHTEAHCLL AVGAALPRPA EASLSMYVRS RRQLEPLSPA ALGSYVAGGA LFLGLISAVH TPLSLDREGR AAEREAQRDE LLASVADLEA ELEAREIDPS LLDEREAIER DLALLQQFEA RLDTLDDRAL AGFSEPLRGL SRQRAEGVWL THIRLRSGAG VFQGRAVAAE DVPAFLDGLA QERAFQGWQF EEFHIQRAAA AEDTADSVRF RVASPGLAGD GEE
|
| |