Gene Haur_5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5066 
Symbol 
ID5737024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp79105 
End bp81528 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content62% 
IMG OID641282231 
Producthypothetical protein 
Protein accessionYP_001547822 
Protein GI159901576 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.493331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCGAT GGCGTTGGTT TGGATTGGTT GGACTGCTGC TCGGACTGCT CAGTGGACTG 
CCCGTGGCCG CGATGGACAC GCAGGATGGC TTGCCCACGG TCGCAGTGGG GATTGCCAGT
CCCACAGCAG GGCAGGTGCT GCATGCGCCG AGTACGACCG TGACCGGAAC CGCTACGGCG
GGATCGCCCG TGACGGTGTG GCTCGATGGT CAGGTTGTTG AGACACTGGA GGCGGGCGCG
AATGGCCAGT GGAGTCTGCC TGTCAGTGGC CTGACCCACG GGACGCATAC GCTATCGGCC
ACCAGTACGC TGGATGCGGG CACATGGCTG TACATGCTCG ATCCGTCGTG TGCGGTTGGC
AGGGAGACCT GTGTGCGGAT TGCTGATCCG GATACCGCGA CGATCCACAC GACGATTCAC
ACCAGCCCGG ATTCGGCGGG TGGTGAGCTG CTGATTGCCC ATCCGACCAT GGATCGGCTG
TATATCGGCC ACTCGACGGG GGTGGATGTC TATACCCACG ACGGAACCTT CATCACGCGG
ATTGCGCTGC CCAAGACGGT GCGCATGGGT GCGTTGACCC CGAATGGCCG CGAGTTGTGG
GTTCCGCAAG ATGCGAGTGA TGGGCGGGGT GGCATGGCGG TGATTGATAC CGCGACCAAC
ACGGTGATCA CGATCTTTGA CACGGCGGTG TACGGCAGCG GCTCCACGCT CGTGACCGCC
AGTGCCGCCC AAGATCTGGT CTTTTCACCC GATGGTCAGA CCGCCTACGC GGCGGATATG
GGCGATTACA GCCTGACGGT CTTGGATGTG GCCAGCCGAA CGGTGCGCTC ACGGTTGATT
CGCGAGGGTG ACGCGGCGGT CGGGCGACGG GTGCTGCTCA ATCAGGCTGG GACGCGCTTG
TATTTGGCGA CGCGGCAAGG GAATGCGCTG TATGTCGTCG ATACGGCGAC TAGCAGTTTC
ACCCGCGTGG CCGTGAGCAA CCCCTATCGC CCGCAGCTGG AGGGGATTGT CCTCAGTCCT
GATGAGTCGA AACTCTACGT CGTGGTCTAC CGGATTGGCG ATAGCTTTAC GCCGCAAAAT
CGGGCGTTGA TCTTGGACAC GGCGACCAAT CAGTGGTTGC CCACCGACCT CCGTTGGCCA
CAGCCGAATC CGGTGTGGGG CGCACGGGCG GCAACCCGCC ATCCGGTGAC CGGTATGGTG
TATATCGGCG GCGGCAATGG GGTGATGGTG TTTGATGGTG AGGAACGCCA GCCCGCGCTG
GAGTTGGCAG CGGGCCTCGA TAACTCGGTC TACGAGTTTG ATTGGTTACG ACGCATTGCG
ACCGCGACCG CCAGTGTGAC GGTGCGGGTG GATTTGTCCT CCGATCTTGG CGTGGAGAAA
ACCCATGCGG GGGATTTGGT CGTGGGGCAG GAAGGAACCT ACACCATTGC GGTCACCAAC
CATGGCCCAG CGGTGATGCC TGCCGGAACG ACGATCACGG ACGACGTGCC AGACGATCTG
CGGGTTGTTG CGGCCAGTGG GGCACATTGG TCGTGTGCTA TCACGGGCCA AACCGTGACT
TGTACCGCGA CCGTGGCCAT GCCAGCGCTC GAAACGGGCA CGGTGCAGAT TCGGGTTATT
CCAGAGGCAG CGGCGGGGGC CAGCGTGATT AATCGGGCCT GTGTCGATAC CCTGATTGAT
GCCAACCCGA CCAACGATTG TGATGATGAC CTGACAACCA TCCTGCATCC GGCCTTGGCC
ATCGGGAAGC GCTCGACCCC ACCCAATGGC ACGGCGGTGG CGGCAGGCAA CACGATTACC
TATTTCCTTG ACGTGACCAA TACCGGAACC GCCCCGTTGA CGGGGGTGAC GGTACGCGAT
GCGATTCCCG AGGCAACGGC CTTGATCGCG GCTGATCCGG CAGTGACTCC CATCGACGGC
GTGCTGACGT GGGAACTGGG TGATCTGGCC GTCGGAGCAA CGCGCACCGT GCAGTTTCAG
GTGCGGGTGT TGCCGATCGG CACAACCGTC GCCATTCGCA ATGTGGCGCA GGCCGACAGT
GACCAAACCA GCGAGCAAGA TTCGAACCTG CTGATTCACC CCTTCGACCC GACCAGTATC
AGCCTGGTGT CCTTTGACGC GGTGGCCACG GGCGGCATGG TCGATCTGCG CTGGGTGACG
GGCAGTGAAG TCAACACGTT GGGCTTTCAC CTCTACCGGA GCACCACCCC GAATCGCAAC
GAGGCCACCC GCGTGACGAC CAGCCTGATT CCCTCACAGG GCGCGACGGG CGGCAGCTAC
CGCCTGACCG ATGCCCATGC CACCGCACCG CTTGGCCAAT GGTCGTATTG GCTGGAAGAA
GTCGAGCTGA ACGGCCAAAC CACCTGGTAT GGCCCGGTGA CGGTACGGAT GCATACGATC
TATGGCCCCG CTGTGATGCG GTAG
 
Protein sequence
MHRWRWFGLV GLLLGLLSGL PVAAMDTQDG LPTVAVGIAS PTAGQVLHAP STTVTGTATA 
GSPVTVWLDG QVVETLEAGA NGQWSLPVSG LTHGTHTLSA TSTLDAGTWL YMLDPSCAVG
RETCVRIADP DTATIHTTIH TSPDSAGGEL LIAHPTMDRL YIGHSTGVDV YTHDGTFITR
IALPKTVRMG ALTPNGRELW VPQDASDGRG GMAVIDTATN TVITIFDTAV YGSGSTLVTA
SAAQDLVFSP DGQTAYAADM GDYSLTVLDV ASRTVRSRLI REGDAAVGRR VLLNQAGTRL
YLATRQGNAL YVVDTATSSF TRVAVSNPYR PQLEGIVLSP DESKLYVVVY RIGDSFTPQN
RALILDTATN QWLPTDLRWP QPNPVWGARA ATRHPVTGMV YIGGGNGVMV FDGEERQPAL
ELAAGLDNSV YEFDWLRRIA TATASVTVRV DLSSDLGVEK THAGDLVVGQ EGTYTIAVTN
HGPAVMPAGT TITDDVPDDL RVVAASGAHW SCAITGQTVT CTATVAMPAL ETGTVQIRVI
PEAAAGASVI NRACVDTLID ANPTNDCDDD LTTILHPALA IGKRSTPPNG TAVAAGNTIT
YFLDVTNTGT APLTGVTVRD AIPEATALIA ADPAVTPIDG VLTWELGDLA VGATRTVQFQ
VRVLPIGTTV AIRNVAQADS DQTSEQDSNL LIHPFDPTSI SLVSFDAVAT GGMVDLRWVT
GSEVNTLGFH LYRSTTPNRN EATRVTTSLI PSQGATGGSY RLTDAHATAP LGQWSYWLEE
VELNGQTTWY GPVTVRMHTI YGPAVMR