Gene Haur_0356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0356 
Symbol 
ID5732266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp425466 
End bp426575 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content52% 
IMG OID641277479 
Producthypothetical protein 
Protein accessionYP_001543135 
Protein GI159896888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000129825 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTAA CTCGCCGTTT ATTGCTCAAA AGTGTATTAA TTAGTGTGAT TTGCGGTGTA 
GCGCTGCTGC AATTATATCG TCAGCGTGTG CCACCAGCCT TTAATTTGCC TGCCGCTGCT
AGTGTGCGAA CCCAGCACCC AATCGTTGGC GTTCATACCC GCTTGATGGG CTTGGATGAA
CCAACGATTC GCCGAACCTT GCAGCAGGTA CGCGAAATGG GCGCAACCAC GATTATTGAT
TTGTTTCCGT GGGCCGTGAT TCAGCCACGT TCAGCCAATA GCTACGAGTG GACGGGCAGC
GATATGCTGA TTGCCCATGC CCAACGCCAA GGTCTGACCG TAATCGCTCG TTTGGATTTT
GTGCCAGCTT GGGCACGTCC TGCCAACACC AGTGATCGCT ATCTCGACCC TGATCACTAT
GCGGCCTACG CTGATTTTGT GGTGGCGTTT GCCCAACGCT ATGTGCCGCA AGGCGTGCAG
GTATTGCAAA TTTGGAACGA GCCAAATCTA CGCTTTGAAT GGGGCGATCG TGCGCCTGAT
CCGGTGGCCT ATGCTAATTT GTTGAAAGTT GTCTATCCGC GGGTCAAAGC AGTTGCCCCC
GAAGCGCTCA TTACCTTGGC TGGGCTTGCC CCAGGTGGCC CAACTGGCCT GATCGATCCG
CAAACACTGA GCGTCAATGA TTTGACCTTT CTTAAATTGT GTTTAGCTGA AAAACCGCCC
TTTGATGCAG TTTCGGTGCA TGCCTATGGC TCGATTAATC CCGCTGAGCA AGCGCCAGAT
CTCACAATTA CTAACTTTCG GCGCACTGAA TTAATTTACG ATTTGGTGCT AGCGGCGGGC
TACGAGGTAC CGTTTTATAT CACCGAGGGC GGCTGGAACG ATCACGCCCG TTGGCCAAAC
GCCGTGTCAC CACCAATCCG TGTGCAAAAT ACGCTTGCTG CGTATGCTTG GGCTGAGCAA
CACTGGCCAT GGATGCACAC TGTGGGCTTT TGGCAATTTT CATTACCCAA TCTTACCTAC
ACCTATGCCG ATAACTACAC CTTTGTCGCG CCCGATGGCA CACCCAAGGC GATTTATTAT
GCAGTGCAAG CATTTACCCA ACAACCCTAG
 
Protein sequence
MRLTRRLLLK SVLISVICGV ALLQLYRQRV PPAFNLPAAA SVRTQHPIVG VHTRLMGLDE 
PTIRRTLQQV REMGATTIID LFPWAVIQPR SANSYEWTGS DMLIAHAQRQ GLTVIARLDF
VPAWARPANT SDRYLDPDHY AAYADFVVAF AQRYVPQGVQ VLQIWNEPNL RFEWGDRAPD
PVAYANLLKV VYPRVKAVAP EALITLAGLA PGGPTGLIDP QTLSVNDLTF LKLCLAEKPP
FDAVSVHAYG SINPAEQAPD LTITNFRRTE LIYDLVLAAG YEVPFYITEG GWNDHARWPN
AVSPPIRVQN TLAAYAWAEQ HWPWMHTVGF WQFSLPNLTY TYADNYTFVA PDGTPKAIYY
AVQAFTQQP