Gene Haur_5300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5300 
Symbol 
ID5737258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp94916 
End bp96163 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content55% 
IMG OID641282464 
Producthypothetical protein 
Protein accessionYP_001548055 
Protein GI159901810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.130766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACC CGCACGTATC GACGATTGAC ACCGCAACGA CCCCTCGCCG TGATCCGTTG 
ACGGTCTACC AATATATGCT TACAGGCCAA CCACTCAAGG ACAACAATGC CCAATTGCCC
CAAACGGTTG AGGTCAAAGG CCAGCCGATG ACGGCGATTG GGATTGATCC GGGCAATGGC
GAGATGAAAG CGGCGATGAT GGGCCTTGAT GGGCGGCTTG TCACGGTGCA AATTATTTCG
GCCTACCGCA TCGCTGTGAC CCTTGGCGGT GGCAAAAGCC CCACGACCTA TACCGTGAAT
GGTGGCCCCT CGTTTTGGAT TGGCCGTGAT GCCGTGCAAA TGAAGGGCGA TGCCTTACCG
ATTGGCCCAA CGGCGGTGCG TTTAGAAGAT CCCCGCCAGA TTGACTTCTA TGCTGCGGGT
GTCGTTGAAC TCTTGATCAA AGCACACTGC GCTCCTGGGC AATACACCCT CGCAACGGGC
TTAGCCTTGC CCAATATGGA GATGCAAGCC CAGGTCAAGA AGAATGAAGC GGGAGAAGAG
GTGGAAGTCT TTGGGGTGGT CGAGGAGAGC AAGCAGGCGA TCAAGGAGCA TATCTACGGC
AAGAGCTACC ATGTGAGCCG TCTTGACGAA GATGGGGACG TGACCAATTG GCAGATCACC
TTTGGCCAAG TCTATACCCA AGCCCAGAGC TATGGCACCT TTATGGCTCT CACGCACACC
ATCTTTGGAA CCCGCCGCAC CGATGGCATT CAAGAGTATG CGATTATTGA CATGGGCCGT
GGCGACACCC ACGAAACCCT CATCCAGTTA TCACCCACGT TCCGCATGAT GACGAAGCGC
ACCGGCGAAG GCACGATCAA GCAAGCACGG GCGGTTGCAC GCGCCTTGGC GGAGTTTGAC
TTGAATGATG CCCAAGCCCA AGAAGCCTTG ATCACGCGGA GTATTCTTGA TGGGGGACGG
CCCAAATCGA TTAATCATGT CGTCGATAAA GTAGTAGAGC GCGAAACCCA AGAAATGCTC
AGTCGCTTGT TACCCGCATT GAGAAATAGA AATGCCTTCA TTGCCTTTAC GGGTGGCGGC
ACCAAGGACG CGACAACCTT GCAAATGATT AATGATCGGA TGGACAGCGT GGGCCGCAGC
GCCGAGAGTT TTGTGATTGT GCATCCCGAA GTCGCCAGCG TCTTGAACGC GGTGGGAACC
TTGTTGAAAG TCTTGTTTAC CGAGTTAGCA CGAAAGGGAC GGGCATAA
 
Protein sequence
MTNPHVSTID TATTPRRDPL TVYQYMLTGQ PLKDNNAQLP QTVEVKGQPM TAIGIDPGNG 
EMKAAMMGLD GRLVTVQIIS AYRIAVTLGG GKSPTTYTVN GGPSFWIGRD AVQMKGDALP
IGPTAVRLED PRQIDFYAAG VVELLIKAHC APGQYTLATG LALPNMEMQA QVKKNEAGEE
VEVFGVVEES KQAIKEHIYG KSYHVSRLDE DGDVTNWQIT FGQVYTQAQS YGTFMALTHT
IFGTRRTDGI QEYAIIDMGR GDTHETLIQL SPTFRMMTKR TGEGTIKQAR AVARALAEFD
LNDAQAQEAL ITRSILDGGR PKSINHVVDK VVERETQEML SRLLPALRNR NAFIAFTGGG
TKDATTLQMI NDRMDSVGRS AESFVIVHPE VASVLNAVGT LLKVLFTELA RKGRA