Gene Haur_3557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3557 
Symbol 
ID5735416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4473017 
End bp4474930 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content54% 
IMG OID641280704 
Producthypothetical protein 
Protein accessionYP_001546321 
Protein GI159900074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC TTCCACTGCG ACGATTGATT TTGTACAAAC ATGGCGTTGG CTATTTCGAG 
CGGCGCGGTC AGTTCGAGGG CACGAAACTG GCGTTGGCCT TCAGTCGCGA AGCCATGGAC
GATGTGCTGA AAAGTTTGGT GGTGTTGGAA AAAGGCGAGG GTCATGTCAA GGGCATCGAT
TACGAACGAC CTGAATCATG GAGCAGCCGA CGAATTGAGC TTGGCGCTGG CCGCCCACTG
CGCGATCTCT TAACCGAATT ACGCGGGCGA CGAATTCGCC TAGCCTTGCG CGATGAAGCT
GCTGCCGAAG GCTTGATCGT CGGCATCGAC GAGCCACCAC CAGAAGAGCC AATTTCGCAA
TCGCTGGTGA CAATTTATCG CAGCGATGTG CGCCAAATCA CCTTGCATCG TTTAAATGAT
ATTCGCGGCG TGCAATTGCT TGATTCGGCT GCCGACGAAG TGCTGTGGTC GTTACGCGCC
AACAGCGATA AACAAGATTC GCGCACAGCC ACGATTCAGC TCGACGAGGG CAAGCACGAT
CTTTTGGTGG CCTACCTTGC GCCAGCCCCC AGTTGGAGGG TTTCCTATCG GATTATTGTC
GATAATATCG AAACCGAATT GCCCGATGTG CTACTCCAAG GCTGGGGTTT ATTCGATAAT
GTGCTTGATG AAGATTTAGA AGATGTGCGG ATTTCGTTGG TTGCAGGCCG TCCAGTTTCG
TTCCGCTACC CGCTCTACGA GCCGCAACAA CCGGAACGCC CGTTGATTGA AGACACAATT
CGGCCTGCCG CACCACCAGC CTATACCTTG GCTGCTCCGG CTCCTGCTAG CGCACCACGA
CCAAAAATGA TGCGAGCCAT GGCTATGCGT GAAGAAGCAT TTGCGGGAGC TGATCTTGAT
AGCCTGAGCA TGGATGAAAT GGCATCAATT GCACCCGTTG CCGAAGGTGC TGCCCAAGGT
GCGTTATTCC GCTACGATGT GCGCGAACCA GTCAGCGTTG GCCGTGGTCG TTCAGCTTTA
GTACCATTGC TCAATTTACG CACCGCCTGT CGGCGCGAGT TGGTCTATCG CGGCGCGGCG
GGCGAGACCC ATCCCATGGT CACGGTACGC TTTGAAAATA GCAGCGGCCT AACCTTAGAG
CGCGGCCCAA TTACAATTAT GGAAAGTCGC TCATATGGCG GTGAAGCAGT CTTGAACTGG
ACAGCCGAAG GGGCAATGGT CACAATTCGC TATGCCCAAG CCTTGGAAGT TTCAGTTAAA
GAAAATCAAA ACTCGGCGCA TCAGACCCGG CGTATCCGGC TTGGTCGCGA TGTGTTAATT
CACGAAGTTG AAGAATCGCT AACCACGATT TACACCGCAA CGAATACTGC TGGCGAAGCC
CGCGTGATCA AAATTGAGCA TCCTTTGCGT CACCCCTACG AGCTGTTCGA TACCGTCCAA
CCAGCCGAAG CCAATAGCCA ATTGGCCAGT TGGTTATTGC CAATTCCAGC GCGAGGCGAA
GCGGCCTTGC GCGTGCGCGA ACGACGCTTG GTTGAACGGC GCGAACACAT TCGCTCGATC
AATTTCGAGC AATTGCGCCG CTATTTGATG GATCGCATGC TCGATCAAAA TACGGTCAGC
GAGTTGCGCG AAGTGCTCAC GCTGTATGCA CGAATTGATA GCATCGTTCA ACGCTACAAC
GAAATTGAAG CGTTACGTCA GAAGATCTAC AATCAACAAA CTCAAATTCG TGGCAACCTC
GATGTACTCA AAAACGAAGG CGGCGAGGGT GAACTCCGCA CCCGCTATAT CACCACCCTA
GCCGAAACCG AGGATCAACT CAACGGCCTG CGTGAGGAAG AAGTGACATT GCGCAGTGAA
GAAGCCGCCT GCCACGCCAA ACTCGAAGAG CATCTCAGCC GCTTTCCGGG CTAG
 
Protein sequence
MTELPLRRLI LYKHGVGYFE RRGQFEGTKL ALAFSREAMD DVLKSLVVLE KGEGHVKGID 
YERPESWSSR RIELGAGRPL RDLLTELRGR RIRLALRDEA AAEGLIVGID EPPPEEPISQ
SLVTIYRSDV RQITLHRLND IRGVQLLDSA ADEVLWSLRA NSDKQDSRTA TIQLDEGKHD
LLVAYLAPAP SWRVSYRIIV DNIETELPDV LLQGWGLFDN VLDEDLEDVR ISLVAGRPVS
FRYPLYEPQQ PERPLIEDTI RPAAPPAYTL AAPAPASAPR PKMMRAMAMR EEAFAGADLD
SLSMDEMASI APVAEGAAQG ALFRYDVREP VSVGRGRSAL VPLLNLRTAC RRELVYRGAA
GETHPMVTVR FENSSGLTLE RGPITIMESR SYGGEAVLNW TAEGAMVTIR YAQALEVSVK
ENQNSAHQTR RIRLGRDVLI HEVEESLTTI YTATNTAGEA RVIKIEHPLR HPYELFDTVQ
PAEANSQLAS WLLPIPARGE AALRVRERRL VERREHIRSI NFEQLRRYLM DRMLDQNTVS
ELREVLTLYA RIDSIVQRYN EIEALRQKIY NQQTQIRGNL DVLKNEGGEG ELRTRYITTL
AETEDQLNGL REEEVTLRSE EAACHAKLEE HLSRFPG