Gene Haur_4879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4879 
Symbol 
ID5736956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6214452 
End bp6216152 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content49% 
IMG OID641282045 
Producthypothetical protein 
Protein accessionYP_001547637 
Protein GI159901390 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.413638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTTTA TCCGAAGAGT GTGTTTCGTG TTGGTCTTTT TAGGTCTGGT TGCCAGTGCG 
TGGCTTAAGC CGGCACCAAT TGCTGCCCAA GCCCCATTGA CAGGGGATCA AACAGCCCCC
TTGTTGAACC CAGTGAGCGA GGGCACTCAA CAAGATTTCG AGGCTGGCCT CAATACCGCA
AATGGTGCTG TGCTCTATCA AACGGCATCA TATCCTGGGG TAAGTTTTCA CCGACGTAAT
TCAACGGTAG GCTACGCCTA CTACGGCGGT GGTTGTACCT ATCTTACGGC GCTGAGCAGT
GGGAACGAGG ATAACAACGC CCTTAGCACC CGCCTCGACA TTCCCGACGG TGCGGTCATT
CTAGGGGTTG AATTTCAATA TCGCGATACC GACGCAGTTA ATAATTCACG CTTATATCTC
TATCGTTTTG ATGGTGCTGG TGGCGTTGCT ACAGTCGCAT TATTAAATAG TAGTGGTAAT
GGTGGCTATG GCTCAAGCTA TACTGCAACC AATCTCAATA CCCTCTACGA TGCCTTTAGC
TATTCCTACT CGTTGGTTTG GTACAGCGGC AGCGTTGGCA TCAACCATGC ACTTTGTGGT
GCACGGGTAA AATATGCTTA TAATCCGCCG GTTGCCCGAC CATTCACGAT GCCTCCTGCC
AACCAACCAG ATGCCTTACA AGGTGGTAGC AGCGGCTATA GTTTTACTGC CGCTAGCGAT
TTTGTAGCCT TTGAAGAGTC TGCTGCCTAT AGTTATTCTA GTGCCGGTTG TATGATTCAT
ACTGGTGGAG CCAACCTTGC AGCTGAGGTT GATTTGGCTG ATGGCACGCA ATTGGCTGGC
TATCGGGCAT ATTATTACAA TACTGCCAGC GGCGCAGCGA TGAATGCAAA TCTGGTATGG
ATCGATGGCT CATCAACAAA TACCGTTCTA ATTGCATCTA CCACAGCTAG TAGCGGTTTT
GCGAATGAGT ATTTTGTGCC ACCTTCGGCC CAGATTATTG ATGAGTTCAG TAAAGGCTAT
CTGATGTATA TTCGGCCTGG AACGTCAACG ACGACCAGAA TGTGTGGTAT TCGTACATTC
TATACGACTC CGGTCCAAGC CAAGCAATTG CCAGTTTATA TCAACCCGGT TAGCTCAGTC
ATTGCAGATC ACGATGCCAG TGGCGTGCTA TTAACCGAAC AGCAGCCTGA ATCAGCCCCG
ATCCAGAGCC TAGATCTAGG GACGGTTGAA GAACTAGCAA GCCCGCTGAT TACCAATGAA
TACCAATTTG TCACAGCTCG TTCATTTATG CCACGCGATA ATGTCTATCT TGTGAACAGC
GCCCCAGCTG GGTGTATTTC GTTTGGCTCT GATGCTGAGG TAGACTATAC CTTCCAATTG
CCACCAAACA GTGGGCTACG AGGAATTCGC TTCTACTATC GTAATATTGC TGGCAATTTG
GGCAATGCGC GTTATTGGGC CTTCGATGGA CGTGGCTGGT ATAATCCAAT TTTCACCTAT
GCGATTCCAA CCTCAACGAA CTATGCTTCG CAGTTGGTAA GCTATACTGG TTCGCACTAC
GATCTTTCCA ATGGCAGTGT GGCCCATTCA CTGAATTTCT CGATCCTTAG CCCGAGCAAT
ACGGTTGAGT TTTGTGGTGC TCGGATTTGG TACACCACCG GCCTGCGTTC GCTCTACATT
CCAGTTGCTT TCAAAAATTA A
 
Protein sequence
MWFIRRVCFV LVFLGLVASA WLKPAPIAAQ APLTGDQTAP LLNPVSEGTQ QDFEAGLNTA 
NGAVLYQTAS YPGVSFHRRN STVGYAYYGG GCTYLTALSS GNEDNNALST RLDIPDGAVI
LGVEFQYRDT DAVNNSRLYL YRFDGAGGVA TVALLNSSGN GGYGSSYTAT NLNTLYDAFS
YSYSLVWYSG SVGINHALCG ARVKYAYNPP VARPFTMPPA NQPDALQGGS SGYSFTAASD
FVAFEESAAY SYSSAGCMIH TGGANLAAEV DLADGTQLAG YRAYYYNTAS GAAMNANLVW
IDGSSTNTVL IASTTASSGF ANEYFVPPSA QIIDEFSKGY LMYIRPGTST TTRMCGIRTF
YTTPVQAKQL PVYINPVSSV IADHDASGVL LTEQQPESAP IQSLDLGTVE ELASPLITNE
YQFVTARSFM PRDNVYLVNS APAGCISFGS DAEVDYTFQL PPNSGLRGIR FYYRNIAGNL
GNARYWAFDG RGWYNPIFTY AIPTSTNYAS QLVSYTGSHY DLSNGSVAHS LNFSILSPSN
TVEFCGARIW YTTGLRSLYI PVAFKN