Gene Haur_5253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5253 
Symbol 
ID5737211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp24538 
End bp26157 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content59% 
IMG OID641282417 
Producthypothetical protein 
Protein accessionYP_001548008 
Protein GI159901763 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0754087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCGA CCCACCCATC TGGTGGGGCC ACCGCCATGC GCACGCCGCA TTGGTTCGCG 
GCCCCCGATA CCGACTATCT CTGGATTCCT TCCTATCTGC TCGAAACCCT GTATGACTCG
CCCACGGCCA TTGGCCTTTT TGCGTTGATT GCGCGGCGCT GGCTCGCCAG TACCACCGCC
ATGGTCGCCT TGAGCGATCA GGACATTCAG CGCTACGACC CAACTATCTG CCGTGGCTCG
ATTCGCGCCG CCATCGATCG CTTGATTGGT GGTGGCTGGG TGCAGGTCGT GCGCCAACGT
GGCCGCAAAA CCCAGTATTG CCCCGCGTGG GGCAAAAGCA CCAATACCCG CGCCTGGTCG
AAAACGGGCA CGCAACTCAA TCGGCGGCGT GTCCGGACGG TGCGGTGTGA TCGCAAACTC
TTTGACGACT ATATGGGGCG GATTATTCCG CATGAGCGGG TTCCTGCCGT GATTGAACGC
TTTGGCGTGC GTGCCCACTT GAGCTTGGCC GATGTGGGGA CGTACCTGGT GATGCAACAT
ACGCCGCATG CGATCAGCGC TACGCCAGCC CTTGAACAGC TCGATTTGTG CTATAACGCG
GAAGCCTTGG CGGTTCCGAC GACCGAGGAA AGCTTGGCCA AAATGGAGTT AAGCGCCTGC
GGAGCGCAGC GGCTGGGGTT GCTCAACGAA CCCCGCCCAC GCCCGATGAA GCAGCCCATG
CTGAGCCAGC ATCTTTTTTT TGTGCCACCG AAGTTGGCTA GGCAGTTGGC TAGGCAGTTG
GCTAGCCAAC ACGCGCTGAA CCAGGCAGGA AATAGCCCAT TGCAATCGGC AAAAACGGCG
GTTGTGAAAG AAAGCACGGT GGTCACAGGC ATGTTAAGCA CTTTAGGCAT TGGAGATCCC
CCTACCCCCA CAATCACAAC GAAACAAGTG CAAAAGAAAA CACTCTGTGG TGGAGAGTTC
TCTTTTCGAG AAAATGGAGA GCGATTAATG ACCAAAAACG ACGAAGGGGA TACGACCAAT
CAACGCTCCA ATCAGCCGCG CCGCCGCCGC AACGTTATGT CAATTCCAGA AACGCCAAGC
GCCAAGCGCC TGCGCGAATT GAACGTGCGG CCCCAAACCT GTGTTGAGTT GGCGGATCTG
CCCGTGGAGT TGATCAACGC CTGTATTGCC GATGGGCAGT CACGGCCTGC GGTCTATGAT
CTGGCAGCGT GGACGGTCTC GATGGCGCGT GATGCGCGGG ATCATGGCTG GCAGGTCGCG
CTCAACAAGC GAGGAGCGCC GCCGACGAAC CAGTGGGATG ATCCTGCGGT GGCGGTTGCT
AAAGCCTTGG CTAGCGGTCT GTTTAACCGC GCGGATGACC TTGAAACCTC GCTGGATGAA
CTCCCATCCA CCCCACCGCA ACGCGGTGGC GATCAGCAGG AGGCCACGGC GACCGATTGC
CCTGCCTGGA TCGCGCCAGC AACGTGGCAA ACGCTGTCGC CTGGACTTCA ACATCTCTTG
GAGCGCTCAC GGTTGCAGGG ACGGCAGGTC GTGGCCTATG ATAGCGGGCG ACAGCGCATG
CTGGCTGATT ACGAGGCGCA GATCGAGCGC TTGGTGATGG CCGCCGTTAT GCGCCGATAA
 
Protein sequence
MPSTHPSGGA TAMRTPHWFA APDTDYLWIP SYLLETLYDS PTAIGLFALI ARRWLASTTA 
MVALSDQDIQ RYDPTICRGS IRAAIDRLIG GGWVQVVRQR GRKTQYCPAW GKSTNTRAWS
KTGTQLNRRR VRTVRCDRKL FDDYMGRIIP HERVPAVIER FGVRAHLSLA DVGTYLVMQH
TPHAISATPA LEQLDLCYNA EALAVPTTEE SLAKMELSAC GAQRLGLLNE PRPRPMKQPM
LSQHLFFVPP KLARQLARQL ASQHALNQAG NSPLQSAKTA VVKESTVVTG MLSTLGIGDP
PTPTITTKQV QKKTLCGGEF SFRENGERLM TKNDEGDTTN QRSNQPRRRR NVMSIPETPS
AKRLRELNVR PQTCVELADL PVELINACIA DGQSRPAVYD LAAWTVSMAR DARDHGWQVA
LNKRGAPPTN QWDDPAVAVA KALASGLFNR ADDLETSLDE LPSTPPQRGG DQQEATATDC
PAWIAPATWQ TLSPGLQHLL ERSRLQGRQV VAYDSGRQRM LADYEAQIER LVMAAVMRR