Gene Haur_5252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5252 
Symbol 
ID5737210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp22374 
End bp24107 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content59% 
IMG OID641282416 
Productintegrase catalytic region 
Protein accessionYP_001548007 
Protein GI159901762 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.56302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAAC GTGCCGCTGC GCCTGCGTGG CAACAACTCC AAATGCAGAT TACCTGCCCC 
AAACAGTATC TCTATGAACT CATTCGCCCC ATTGTCTTAA CCGAAGAGTC GGTGGCCGAC
CGCGCTAGCG AAACCGCCGT GCCCGATCAC ATCATCACCT ATCATCTTGA CCATTTTCGC
ACCAATGGCT TACCTGGCTT GGTTGCCGCT CCCACGCCCC TCCAACGCGC TGCCCGCTTT
CCCGAAGAAT TGGTCGCCTT TGTGCTGGCG CTCAAAGCGG AGCATCCGCC GCTCACCGCC
CATGAACTGG CTACGATTTG TTTTATCAAA CACGGTCGGC GGCCCAGTAT CAAAACCATT
CGGCGCGTGC TCGCCCATCA CCCGCTCCCG CAGCTCACCA CGCGGCGGTT CCCTCCGTTC
CATGACAATC CTGATCCCAT CCAACGCCGC CACGCCATCC TGGTCTTAAG CCTTGAGGGC
TGGACGAAAA AACGGATTGC GACCTATCTT CAGATTAGCC GTTCCACCGT GTACAACACC
TTTGCACGCT GGCACAAGGA GGGCTTCGCA GGGTTACAAG CAAAGTCCCG TGCGCCGCGC
CGTCGCCATC CCAAAGTCAC CATCGCTATC CAGCAGCGGG TTCGCCGCTT ACAGCGCAAC
CACTTGTTGG GTGCGTGGCG CATGCATGCC GCTCTGCGCC GTGAGGGCAT CCGCCTCAGT
CCGCGCACCT GTGGGCGCAT TATGGCGGTC AATCGCGACC TCTGCCCCGA ATTGCCCAAA
CGGCAGCGAT CACGGAAGCA TGAACCGCGT GCCATGCCCT TTGCTGCCCA GTACCGACAT
CAGTACTGGA CGATTGACAT CCGCTATCTC GATATGCACC GCTTGGGTGG CGGCCATATT
TACTGTATCT CGATTGTCGA AAATTACAGC CGCGCCATTC TTTCCAGCGC GATCAGTCGG
ATTCAGGACA CCACTGCCGT GTTGAAAGTG CTCTATGATG CGGTGGCAAA GTACGGCTGT
CCTGATGGGA TTGTGAGCGA TTCAGGCAGT GTGTTCCGCT CGCATCGGCT CCAGGAGGTG
TGTCAGCACC TGCGGATTCA GCAGTGCCCG ATTGAGAAGC GCCAACCCTG GCAATCCTAC
ATCGAGACGA CCTTCGGTAT CCAACGGCGT ATGGCTGACG AAGCAGAGGA GGGGTTTCGG
GCCGCGCAGA GTTGGGATGC GCTCTGGCAC GCGCATCGCA CATGGCTACT CCACTACAAC
ACCGAAGTGC ATTGGGCACA TCGGCAGCGC CAAGATGGGC GAGAAACGCC AGCCGAAGTC
CTGACGTGGA TTCGCGGTCG CCCCTATCCG GAACGGCTGC TCCAGCGCAT TTTTGCCGCA
ACACGGGTGA AACGGCGTTT GGATCGGGTG GGCTTTCTGC GGGTGCGCCG CTGGCGGATC
TACAGTGAAA TTGGCTTGGC CAAGGAAGCC GTTGAGGTAT GGCTGGAGGC GCAGCATGTC
ACGATTACCT ATGCGGATCA CCACCTGCGG TCGTATCCGG CGACGATTGA TACCGATGGG
TGGATACGGA CGATGGAGAC GGGCACGCGC TACGAGCATC CCTTTGGGAG TCGGCAGTTA
ATGCTGTGGG ATGTGATTGA AGACGACTGG GGCAAGACCC AGTATGTCGG CAGTACGCCG
CGCCGAAAGT CCACCATCCC CCTCGCAGAG CAGTTGCGCT TCGCTCTTGG GTAA
 
Protein sequence
MPKRAAAPAW QQLQMQITCP KQYLYELIRP IVLTEESVAD RASETAVPDH IITYHLDHFR 
TNGLPGLVAA PTPLQRAARF PEELVAFVLA LKAEHPPLTA HELATICFIK HGRRPSIKTI
RRVLAHHPLP QLTTRRFPPF HDNPDPIQRR HAILVLSLEG WTKKRIATYL QISRSTVYNT
FARWHKEGFA GLQAKSRAPR RRHPKVTIAI QQRVRRLQRN HLLGAWRMHA ALRREGIRLS
PRTCGRIMAV NRDLCPELPK RQRSRKHEPR AMPFAAQYRH QYWTIDIRYL DMHRLGGGHI
YCISIVENYS RAILSSAISR IQDTTAVLKV LYDAVAKYGC PDGIVSDSGS VFRSHRLQEV
CQHLRIQQCP IEKRQPWQSY IETTFGIQRR MADEAEEGFR AAQSWDALWH AHRTWLLHYN
TEVHWAHRQR QDGRETPAEV LTWIRGRPYP ERLLQRIFAA TRVKRRLDRV GFLRVRRWRI
YSEIGLAKEA VEVWLEAQHV TITYADHHLR SYPATIDTDG WIRTMETGTR YEHPFGSRQL
MLWDVIEDDW GKTQYVGSTP RRKSTIPLAE QLRFALG