Gene Haur_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2054 
Symbol 
ID5733942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2562982 
End bp2564682 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content58% 
IMG OID641279196 
Producthypothetical protein 
Protein accessionYP_001544823 
Protein GI159898576 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0120012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGC GTGCGGCGTT GTTTGGTGAC TATCCCATTC CTGAGGACAC CGTTGAATTG 
GCACACGCCA TTGCTCCACA TGGCAACCGA CTCATGCACC TCCGTGATCA CTTTGGCATG
CTGTTTGACA ATCAGCAATT CAGCACGCTC TTTTCCCATA CTGGTCAACC GGCCCTCGCG
CCAGCACGAC TCGCCATGGT CACCATCCTC CAGTTCATGG AAGATCTCCC CGATCGCCAA
GCCGCCGATG CCGTGCGGAT GCGCATTGAT TGGAAATATG TCTTGGGCCT TCCGCTTGCT
GATCGTGGCT TTGATGCCTC CGTCCTCAGC GAGTTCCGCG CCCGGCTTGT GGCGGGAGAT
GGTGCGTCTA TCCTCTTTGA AACCCTGCTG GAGCGCCTTC GCGATTACGG ATTACTGCGA
ACACGCGGGC AACAACGCAC CGATTCGACC CATGTCCTCG CGGCAGTTCG TGGCCTGAGT
CGCATTGAAT GTCTTGGCGA AACGATGCGT GCCACCCTCA ATGCGCTCGC AACCGTGGCT
CCCGCATGGG TCCGCTGCCA GATTCCACCG CCGTGGTTTG ATCGCTATGG GCCACGTGCT
GATGCATATC GCTTTCCGAA GGCAGCCGCC GACCGTCAAC GTCTTGCCGA GCAGATTGGA
GCCGATGGGT TTGAACTCCT CACCCTCCTT GCGGCACCGA CTGCCCCGCG TGAACTGCAC
GTTCATCCCG CCGTATGCAT CCTTCGTCGC GTTTGGTGGC AACAATACCA TGCCCCCAAT
GGACCAGTTC GCTGGCGTGA AGTGGCTGAT ATGCCACCGA GCAGGATGCG CATTCATTCG
CCCTATGATC GCGATGCGCA GTACAGCACG AAACGCAATA TGGAATGGAC GGGATACAAA
GTGCATCTGA CTGAAACCTT CGATGCTGAT CTGCCCTGCC TCATCACGCA TGTGCTTACC
ACGCCATCCA CGATTCGTGA TGGTGAAGTC TTAGACCAGA TTCATGAGGG CCTTGCTCGC
CATGATCTCC TTCCCAGCAC CCATCTTGTT GATACGGGCT ATACCGATGC AGCAGCGATG
CTCACCAGCC AGTCCACCTA TGGGATTACA CTGTGCGGGC CAATTGCTCG CGATAGTGCC
TGGCAAGCGA AAGACCCGAC CGCCTTCGAT ATCACGCGAT TTCAGGTCGA TTGGGACGCG
AAGGTGGTCA TTTGTCCCCA AGGACACGCA AGTACCAAAT GGATTTCGCA TCAGGATCGA
CACGGGAATC CTGCCATTCG CGTGACGTTT CGACCGCGTG ACTGCCGAGC GTGTCCAGTC
CGAACACAGT GTACGCATAC GGCAACCGCA GCACGAGGCC TTTCGCTCCG CCCCCGAGAA
CAGCATGAAG TCCTTCAGCA GCGGCGGCAC GCCCAAACAA CCGATGCCTT CAAACGGCAG
TATGCAAAAC GGGCGGGAGT CGAGGGACTA ATGTCGCAAG CAACCCGAGT CTGTGGGATG
CGGCAGAGTC GCTATGGTGG GATGGCGAAA ACGCGACTCC AGCATGTGCT GACCGCGTGT
GCGCTGAATC TGCTGAGGAG TGTGGCATGG GTGACCGGTG GGTCGCGTCA CCAAACCCAA
ACGTCGCGCT TTGTGGCCCT CCGTCCACCG CCTGCTCTGT CTCAGATACG TGAACAGACA
CGGCTCCATA GCCACCAATG A
 
Protein sequence
MTMRAALFGD YPIPEDTVEL AHAIAPHGNR LMHLRDHFGM LFDNQQFSTL FSHTGQPALA 
PARLAMVTIL QFMEDLPDRQ AADAVRMRID WKYVLGLPLA DRGFDASVLS EFRARLVAGD
GASILFETLL ERLRDYGLLR TRGQQRTDST HVLAAVRGLS RIECLGETMR ATLNALATVA
PAWVRCQIPP PWFDRYGPRA DAYRFPKAAA DRQRLAEQIG ADGFELLTLL AAPTAPRELH
VHPAVCILRR VWWQQYHAPN GPVRWREVAD MPPSRMRIHS PYDRDAQYST KRNMEWTGYK
VHLTETFDAD LPCLITHVLT TPSTIRDGEV LDQIHEGLAR HDLLPSTHLV DTGYTDAAAM
LTSQSTYGIT LCGPIARDSA WQAKDPTAFD ITRFQVDWDA KVVICPQGHA STKWISHQDR
HGNPAIRVTF RPRDCRACPV RTQCTHTATA ARGLSLRPRE QHEVLQQRRH AQTTDAFKRQ
YAKRAGVEGL MSQATRVCGM RQSRYGGMAK TRLQHVLTAC ALNLLRSVAW VTGGSRHQTQ
TSRFVALRPP PALSQIREQT RLHSHQ