Gene Haur_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4021 
Symbol 
ID5735882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5131741 
End bp5133042 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content47% 
IMG OID641281171 
Producthypothetical protein 
Protein accessionYP_001546781 
Protein GI159900534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAGTA TGGTTGCTGC TTCGCCGAAT CCTTTGTATG CTGCTTCAGC CTTTGTTGAT 
GTGTATCCAT TTGTGCGTCA GCAAGAGAGC GAAGAAGAGT TATTAATTGG GCGAGTTGAT
ACTAATAATT TTATTATGCT GCCGAAAGAG GCAGTCGAGG TTTTGGATGA TTTGGCGCAG
GGCAAGAGTG TTGGTGAAGC CCAAGCCCTC TATGCCGAAC GCTACGGCGA AATCCCCGAT
CTGGCCGATT TGCTTGAACA ATTAGAATCT GAAGGTTTTG TTCAGCCACT GCATTCCGAT
ACAGTGCGTT TTGGTCAACA ATCCCCCGTA ACTGCTGCTA CTGCTAGCGC CAACCCCAAT
CAACCACGTG CTGTACGCTT TCACTTTACC TTCTTTCCAA TCCGTTTGGC TCAAGTGTTG
TTCAGCCCGA TCTTACTGGT TTGCTATGCG CTCTTTATTG GTGGGGCGGC GGCAATCGTT
GTAGCTCAAC CATCAATTGT GGCCGGCTGG CGAGCCATGG TCGTCGATCA ACAAATGGCG
CTCTTTACCT TGATCATCAT GCTGCATGGG TTTGTGATCA CCTTTTTCCA TGAGCTTGGG
CATGCGGTGG CGGCCCGTTC GCGGGGAGTC GATGTGCGCT TTGGCATTGG GCGACGTTTA
TGGGTGATTG TGGCTGAAAC TGATATGTCA GGCATTTGGT CGATTCAGCG CAACCTGCGA
TTTTTACCAA TTTTTGCCGG CATGATTGTC GATTTGCTCA GTGCAGCAAT TATGGTCTAC
CTCGCATTTA TGCATCAACG CCAAATCATT AATCTTTCTG ATTTTGGCTA TATCTTGGTG
CGGGCGTTTA TGTGGAGCTA CCTGCTGAAT TTGCTGTTTC AATTTTATTT CTTTGTGCGC
ACCGACATCT ACTATGTCCT CTCGACATGG CTTCGTTGCT CAAACTTAAT GGGCGATACA
GCAAATTATA TGATCAATCG TTTCAATCGT TTATTGGGAC GCGCTGAAGT TCATAATCAG
GCAGCTATTC CTGAACGTGA GCGCAAGATT ATCAAACGAT ATGCCTTTTT CTGGCTAATT
GGGCGGATGC TGGCGTTTTA TTCACTCTTC TTCTTAACCC TGCCAATTTT ATGGAGCTAT
GCCAGCATTT TATTTGAACG AATGTTTGGT AGCGCCAGCG CTGGTATGCA GGTGCTCGAT
TCAATTTTGG CAGCCATTTT GATCTTTATT AGCCAAGCTG TTGGAATTTT CCTCTGGCTT
TGGAGCTTGA TCCGCAGAAA GGTTAGCGTC GATGACATCT AA
 
Protein sequence
MTSMVAASPN PLYAASAFVD VYPFVRQQES EEELLIGRVD TNNFIMLPKE AVEVLDDLAQ 
GKSVGEAQAL YAERYGEIPD LADLLEQLES EGFVQPLHSD TVRFGQQSPV TAATASANPN
QPRAVRFHFT FFPIRLAQVL FSPILLVCYA LFIGGAAAIV VAQPSIVAGW RAMVVDQQMA
LFTLIIMLHG FVITFFHELG HAVAARSRGV DVRFGIGRRL WVIVAETDMS GIWSIQRNLR
FLPIFAGMIV DLLSAAIMVY LAFMHQRQII NLSDFGYILV RAFMWSYLLN LLFQFYFFVR
TDIYYVLSTW LRCSNLMGDT ANYMINRFNR LLGRAEVHNQ AAIPERERKI IKRYAFFWLI
GRMLAFYSLF FLTLPILWSY ASILFERMFG SASAGMQVLD SILAAILIFI SQAVGIFLWL
WSLIRRKVSV DDI