Gene Hhal_0332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0332 
Symbol 
ID4711284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp374975 
End bp376636 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content70% 
IMG OID639854792 
Productfimbrial assembly family protein 
Protein accessionYP_001001928 
Protein GI121997141 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.715461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGTCTGG CGTGGGAGTA TGGTGTTCTG TCGATATCCG CATTAAACGT CAGGCTTCCG 
CCGCGGATCC CCCCTCCCTT GGCAAGCCTT AACAGGCACT GGGACAAGCA AGGTGACCTT
GTGTTGAACG ACCTCAGGCA GTGGCTGCCC CTGCCCGGAC TCCGCCGTCG GACGCGGGTC
GGGGTGTTCC TCGGCAACGA GCATATAGCA CTGGCAGCGG TCTCCAGCGA TGGCGAGCAA
CTGCTGGCCT GCGACTTCCG GGACGCCCGG CGCGACAACC AGCAGATGGC ACTGCGCGAT
CTGGTCGAGC AGTACGGCCT GGGTGGCTCC GAGGCCGTGG TCGTCCTCGA CGGCACCGAC
TACCAGACCC AGCAGGTCGA TGCCCCGCGG GTCCCCGATG AAGAGCTCTC CGGGGCCGTG
CGGTTCCAGC TCAAGAACCT GCTCTACATC CCGCTGGAGC AGGCCATGGT CGGGGCCCAC
CGACAGCACA GCGATCGCTG GAATCAGGAG GGGCAGCGCG CCCTGGCGAC CATCGCCTCG
CGCACGCGGA TCGAAGGGAT CCAGGAGCTG GTCGCCCGCG CCGGGCTCAA GCTCCAGGCC
GTACTTCCCC GCGAGACGGT CCTCAACGAT CTAAGCGCCG CGGCGACCGA GGGCGCCGGC
GGGATCGTCC TCGCCACCCT CGGACGGGAC GACGGGCTGA TCACCATCAG TCGCGGCGAA
CTCCTCTACC TGGCCCGTAG CCACTCGGTG GGCACGCGGC GGCTGGCTGA GGACGGCCAG
GCCGTCGAGA TCCTCGAGGA TGAGCTGCGT CGCTCGATCG ACTACTTCGA CGGGCAGCTG
TCCACGGGGC CGGCGAGCCG GATACTGCTC GCGCCCTGCG AGGCGAATCG TGAGCCGCTG
ATTGACCGTT TCAACGACAG CTTCGAGATC CCCTGCGCCC GGCTGCGCCT CGAACAGATC
TTCGACCTGG AACCGCTCGG TGATGAGCTC GACGAGCACA CCGAGGCCCA CTGCCTACTG
GCTGTGGGCG CGGCCCTGCC GCGGCCCGCC GAGGCGAGCC TGTCGATGTA CGTTCGTTCG
CGTCGGCAGC TGGAGCCCCT GTCGCCGGCA GCGCTGGGGA GCTATGTGGC CGGCGGGGCG
CTCTTCCTGG GCCTGATCTC GGCGGTGCAC ACGCCGCTGT CGCTCGATCG GGAGGGGCGT
GCCGCGGAGC GCGAGGCGCA GCGGGACGAG CTGCTGGCGT CGGTGGCGGA CCTGGAGGCG
GAGCTCGAGG CGCGGGAGAT CGACCCCAGC CTCCTCGACG AGCGCGAGGC CATTGAGCGG
GACCTCGCCC TGCTGCAGCA GTTCGAAGCC CGGCTGGACA CCCTGGATGA CCGCGCCCTG
GCCGGCTTCT CGGAGCCGCT GCGTGGCCTG TCGCGCCAGC GCGCGGAGGG GGTGTGGCTG
ACCCACATCC GGCTGCGCTC CGGCGCCGGC GTGTTTCAGG GGCGGGCGGT GGCGGCGGAG
GATGTACCTG CCTTCCTCGA CGGCCTGGCC CAAGAGCGCG CCTTCCAGGG GTGGCAGTTC
GAAGAGTTCC ACATCCAGCG CGCCGCGGCT GCGGAGGATA CCGCCGATAG CGTCCGTTTC
CGCGTGGCCA GCCCCGGTCT CGCCGGTGAC GGAGAGGAGT AG
 
Protein sequence
MRLAWEYGVL SISALNVRLP PRIPPPLASL NRHWDKQGDL VLNDLRQWLP LPGLRRRTRV 
GVFLGNEHIA LAAVSSDGEQ LLACDFRDAR RDNQQMALRD LVEQYGLGGS EAVVVLDGTD
YQTQQVDAPR VPDEELSGAV RFQLKNLLYI PLEQAMVGAH RQHSDRWNQE GQRALATIAS
RTRIEGIQEL VARAGLKLQA VLPRETVLND LSAAATEGAG GIVLATLGRD DGLITISRGE
LLYLARSHSV GTRRLAEDGQ AVEILEDELR RSIDYFDGQL STGPASRILL APCEANREPL
IDRFNDSFEI PCARLRLEQI FDLEPLGDEL DEHTEAHCLL AVGAALPRPA EASLSMYVRS
RRQLEPLSPA ALGSYVAGGA LFLGLISAVH TPLSLDREGR AAEREAQRDE LLASVADLEA
ELEAREIDPS LLDEREAIER DLALLQQFEA RLDTLDDRAL AGFSEPLRGL SRQRAEGVWL
THIRLRSGAG VFQGRAVAAE DVPAFLDGLA QERAFQGWQF EEFHIQRAAA AEDTADSVRF
RVASPGLAGD GEE