Gene Haur_4284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4284 
Symbol 
ID5736143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5470500 
End bp5472068 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content52% 
IMG OID641281444 
Producthypothetical protein 
Protein accessionYP_001547044 
Protein GI159900797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCT ACCCTTGTAA GCTAACCTCT CCCGCACGCG AGCGAGGGGG AATTGATACC 
ATCATCACAG GTGGAATGCC CCTCTCCCGC CGCAGTGGGC GAGGGGTCGG GGGTGAGGGA
AAGCCATTAC ACATTGAGGA ATCTAGGCCC ACTATGAAGC GTCTTTTATC TCAAGCCGGG
CCGGTGGTGA TTGTGGCCCT GTGGCTGGCG CTGCTGCCAA TTAGTTTGTT TCGGCTGTAT
GCCACCGATG AAGTGCAATA CTTTGCCTAT TTGCGCTCGG TCTATTTCGA TGGCAATTTG
GATTTCGCCA ACGAATATGG CTATTTTGCC GATCTCGGCA TGCAAAAGGG CGATCCAGCA
GTCTATAATG CGCTGCTCAA AGATCGTTCA AGCGATCCGC CGCTCAACCC TATCACAGGC
TTATATCGCA ATGTTGCGCC TGTTGGCTCA GCAATCTTGT GGTCGCCGTG GTATGTGGTT
GCCGATGGTT TGGTTGGCGT GGGTATCTTT GGCGATGCGC CACGCGATGG ATTTAGCCAG
CCCTACATCA TTGCCGTGTG CTTAGCATCG GCCTGCTACA CGCTGTTTGG TTTGCTGCTT
TCCTATCGTT TGGCGCGGCG TTGGGTTGGT ATGTGGGCAG CCACATTAGC CACCTTGAGC
ATTTGGCTAG CCTCGCCATT GATTTGGTAC ACCTACATTC AAGTGCCTTG GTCGCATGGC
GCGGGCTTTG CGATGGTGGC CTTGTTTATC ACGATCTGGC TTGGGCCTAC CGATCAACCG
CTGTTAGCCC AAGGTTCGCA GCGTTCATGG GTGCGTTGGC TGGCCTTGGC CATCGTCGGC
GGCTTGATGA CACTGACGAG GGAACAGCTT GGCCTATTTT TGCTCTTGCC AGCAGTTGAA
GGTTTAGTCG CCTATGCTAG CTTAATTCGC CAAGGTCAAT GGCTGCAAGT GCGCCAACTT
TTGGCTAAGC ATGTATTTTT TGTGTTGATA TTTGCCCTGA GCCTTGCTCC ACAGTTGATC
AGCTACAACA TTTTGTATGG CCAGCCCAAG CCGTCAGGCA CGGTTTCGGG CAAATTGAAT
CTGATCAGCT ATAAATTTTT GCATACCTTG TTCGACCCAC GGCGGGGAGC GTTTATGTGG
CATCCGCTGT TGCTGGTCGG CTTAGCTGGC TTGATTTGGC TCTGGCGCAA GGATCGGCTG
CTGACTGGAT TGCTCAGTTT AGGCCTATTT GCCCAAATTT ATCTGAATGG GGCGTTTGGC
TCGACATGGC ATTTGCAAGG CTCGTTCGGC TTTCGGCGCT TGATCGAATG CACGCCAATT
TTTATTATTG GTTTGGCATT ATTGATCGAG CGAATTCGCT GGCCCAAAGC GGCGATTGCC
AGCCTAGCAC TCGTATTCAT TGTTTGGAAT GGCGGCTTAA TTTTTCAAGC GGCGACTGAC
CGCGAGATTC GTGGGCCAGG CTTGCGCTGG AATACCATGC TCGCTGATCA GCTTAAAGTG
CCGCAATTGG TTTGGCAAAA AGCCGATCAA CTGCTGTTTA ATCGCTGCGA AGTCGTTAAA
AATTGCTAA
 
Protein sequence
MQTYPCKLTS PARERGGIDT IITGGMPLSR RSGRGVGGEG KPLHIEESRP TMKRLLSQAG 
PVVIVALWLA LLPISLFRLY ATDEVQYFAY LRSVYFDGNL DFANEYGYFA DLGMQKGDPA
VYNALLKDRS SDPPLNPITG LYRNVAPVGS AILWSPWYVV ADGLVGVGIF GDAPRDGFSQ
PYIIAVCLAS ACYTLFGLLL SYRLARRWVG MWAATLATLS IWLASPLIWY TYIQVPWSHG
AGFAMVALFI TIWLGPTDQP LLAQGSQRSW VRWLALAIVG GLMTLTREQL GLFLLLPAVE
GLVAYASLIR QGQWLQVRQL LAKHVFFVLI FALSLAPQLI SYNILYGQPK PSGTVSGKLN
LISYKFLHTL FDPRRGAFMW HPLLLVGLAG LIWLWRKDRL LTGLLSLGLF AQIYLNGAFG
STWHLQGSFG FRRLIECTPI FIIGLALLIE RIRWPKAAIA SLALVFIVWN GGLIFQAATD
REIRGPGLRW NTMLADQLKV PQLVWQKADQ LLFNRCEVVK NC