Gene Haur_5024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5024 
Symbol 
ID5736983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp31668 
End bp33401 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content59% 
IMG OID641282191 
Productintegrase catalytic region 
Protein accessionYP_001547782 
Protein GI159901536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAAC GTGCCGCTGC GCCTGCGTGG CAACAACTCC AAATGCAGAT TACCTGCCCC 
AAACAGTATC TCTATGAACT CATTCGCCCC ATTGTCTTAA CCGAAGAGTC GGTGGCCGAC
CGCGCTAGCG AAACCGCCGT GCCCGATCAC ATCATCACCT ATCATCTTGA CCATTTTCGC
ACCAATGGCT TACCTGGCTT GGTTGCCGCT CCCACGCCCC TCCAACGCGC TGCCCGCTTT
CCCGAAGAAT TGGTCGCCTT TGTGCTGGCG CTCAAAGCGG AGCATCCGCC GCTCACCGCC
CATGAACTGG CTACGATTTG TTTTATCAAA CACGGTCGGC GGCCCAGTAT CAAAACCATT
CGGCGCGTGC TCGCCCATCA CCCGCTCCCG CAGCTCACCA CGCGGCGGTT CCCTCCGTTC
CATGACAATC CTGATCCCAT CCAACGCCGC CACGCCATCC TGGTCTTAAG CCTTGAGGGC
TGGACGAAAA AACGGATTGC GACCTATCTT CAGATTAGCC GTTCCACCGT GTACAACACC
TTTGCACGCT GGCACAAGGA GGGCTTCGCA GGGTTACAAG CAAAGTCCCG TGCGCCGCGC
CGTCGCCATC CCAAAGTCAC CATCGCTATC CAGCAGCGGG TTCGCCGCTT ACAGCGCAAC
CACTTGTTGG GTGCGTGGCG CATGCATGCC GCTCTGCGCC GTGAGGGCAT CCGCCTCAGT
CCGCGCACCT GTGGGCGCAT TATGGCGGTC AATCGCGACC TCTGCCCCGA ATTGCCCAAA
CGGCAGCGAT CACGGAAGCA TGAACCGCGT GCCATGCCCT TTGCTGCCCA GTACCGACAT
CAGTACTGGA CGATTGACAT CCGCTATCTC GATATGCACC GCTTGGGTGG CGGCCATATT
TACTGTATCT CGATTGTCGA AAATTACAGC CGCGCCATTC TTTCCAGCGC GATCAGTCGG
ATTCAGGACA CCACTGCCGT GTTGAAAGTG CTCTATGATG CGGTGGCAAA GTACGGCTGT
CCTGATGGGA TTGTGAGCGA TTCAGGCAGT GTGTTCCGCT CGCATCGGCT CCAGGAGGTG
TGTCAGCACC TGCGGATTCA GCAGTGCCCG ATTGAGAAGC GCCAACCCTG GCAATCCTAC
ATCGAGACGA CCTTCGGTAT CCAACGGCGT ATGGCTGACG AAGCAGAGGA GGGGTTTCGG
GCCGCGCAGA GTTGGGATGC GCTCTGGCAC GCGCATCGCA CATGGCTACT CCACTACAAC
ACCGAAGTGC ATTGGGCACA TCGGCAGCGC CAAGATGGGC GAGAAACGCC AGCCGAAGTC
CTGACGTGGA TTCGCGGTCG CCCCTATCCG GAACGGCTGC TCCAGCGCAT TTTTGCCGCA
ACACGGGTGA AACGGCGTTT GGATCGGGTG GGCTTTCTGC GGGTGCGCCG CTGGCGGATC
TACAGTGAAA TTGGCTTGGC CAAGGAAGCC GTTGAGGTAT GGCTGGAGGC GCAGCATGTC
ACGATTACCT ATGCGGATCA CCACCTGCGG TCGTATCCGG CGACGATTGA TACCGATGGG
TGGATACGGA CGATGGAGAC GGGCACGCGC TACGAGCATC CCTTTGGGAG TCGGCAGTTA
ATGCTGTGGG ATGTGATTGA AGACGACTGG GGCAAGACCC AGTATGTCGG CAGTACGCCG
CGCCGAAAGT CCACCATCCC CCTCGCAGAG CAGTTGCGCT TCGCTCTTGG GTAA
 
Protein sequence
MPKRAAAPAW QQLQMQITCP KQYLYELIRP IVLTEESVAD RASETAVPDH IITYHLDHFR 
TNGLPGLVAA PTPLQRAARF PEELVAFVLA LKAEHPPLTA HELATICFIK HGRRPSIKTI
RRVLAHHPLP QLTTRRFPPF HDNPDPIQRR HAILVLSLEG WTKKRIATYL QISRSTVYNT
FARWHKEGFA GLQAKSRAPR RRHPKVTIAI QQRVRRLQRN HLLGAWRMHA ALRREGIRLS
PRTCGRIMAV NRDLCPELPK RQRSRKHEPR AMPFAAQYRH QYWTIDIRYL DMHRLGGGHI
YCISIVENYS RAILSSAISR IQDTTAVLKV LYDAVAKYGC PDGIVSDSGS VFRSHRLQEV
CQHLRIQQCP IEKRQPWQSY IETTFGIQRR MADEAEEGFR AAQSWDALWH AHRTWLLHYN
TEVHWAHRQR QDGRETPAEV LTWIRGRPYP ERLLQRIFAA TRVKRRLDRV GFLRVRRWRI
YSEIGLAKEA VEVWLEAQHV TITYADHHLR SYPATIDTDG WIRTMETGTR YEHPFGSRQL
MLWDVIEDDW GKTQYVGSTP RRKSTIPLAE QLRFALG