Gene Haur_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2801 
Symbol 
ID5734682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3560973 
End bp3562181 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content51% 
IMG OID641279944 
Producthypothetical protein 
Protein accessionYP_001545567 
Protein GI159899320 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.99719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGC TTGCACGTGT TACCGCAACC ATCCTGTTTA CGCTGGCGCT TGTGGTATTA 
ATGTGGATGT TCAGCGATGC GGTGTTGCTT TTCTTAGTTT CGCTGGTGAT TGCCTCGGCT
GTGCGGCCCT TTGCTCAGCG CTTGATTGAT CGGGGCATGA GCCGTGGAGC GGCCTATGGC
TTGGTCTATT TTGCCAGTTT TGTGTTTTTT GGAGGCTTGT TTGCGGTGAT CAGCCGTGGT
TTGCTGAGCG AATTACAGCT AGCCGGCGAC CAATTATTGA ATTTTTACAC TCGCGTCACG
CTAACCTGGC CCCAAGGCCA AGAATGGCAA CAAACGATAG CTAGCGAACT GCCCTCGCGT
CAAGCATTGC TCGATTATTT TGCAGGAACT GATGGGACAG CAGTAATCAA TAGCATCGTG
GGGTTTGGCA GCGGCGTATT CGATTTTCTA GCACAGTTGG TGATTTTGCT CTCACTGAGC
ACCTATTGGG CAGTTGATCA AAACCGCTTT GAGCGTTTAT GGCTTTCGTT GATCAGTGCC
AAACATCGCC AACGCGCCCG CACCATTTGG CGGGCAGTGG AGTTAGAAGT TGGTGCCTAT
ATTCGCAGCG AAGTTGCCCA AAGCTCGTTG GCAGGCATAA GTTTAGGGAT TATTTTTGAA
TTGATTGGCT TGCCCTATCC AACGCTTTTG GCCTTGTGGA TTGCCATTGC TTGGCTGGTG
CCATGGGTTG GGGTATTGTT GGCCTTGATT CCAGCGGTGG CCGTAGGCTT ATTGGTCAAC
ATTCCTGTGG CCTTATTGGC AGGTTTATCG ACAATCGCCG TGTTTGCCTT TTTGCAAATT
TGGGTCGAGC CACGCTTGTA TGGGGCCAAT CGGGTTAGTC CAGTTTTGGT GGTGCTGATA
TTAATGGTGA TGGCGCAAGC CTTTGGGATT TTGGGCATGC TGATTGCCCC ACCCTTGGCA
GCCACCATTC AAATTTTGGG GCGCGAACTG TGGTTTAGCA AATCGCCAGA GGAGCAATTG
GCTGATGATT ATCGCAATCA GTTGCATACC CTACGCCAAC GGCTGGCGAC GATTGAGCGC
GACGAGGTGA CAGCAGAAGT CGCAGCGACC GCCCAACATC AATCGTATAT TCGCCAAATT
GAGGCGCTAA TGGATCAGAC TCAAGCTGCG TTGGGCAATG CTGGCATGCT CAATCGTGAG
CAATCGTAG
 
Protein sequence
MKQLARVTAT ILFTLALVVL MWMFSDAVLL FLVSLVIASA VRPFAQRLID RGMSRGAAYG 
LVYFASFVFF GGLFAVISRG LLSELQLAGD QLLNFYTRVT LTWPQGQEWQ QTIASELPSR
QALLDYFAGT DGTAVINSIV GFGSGVFDFL AQLVILLSLS TYWAVDQNRF ERLWLSLISA
KHRQRARTIW RAVELEVGAY IRSEVAQSSL AGISLGIIFE LIGLPYPTLL ALWIAIAWLV
PWVGVLLALI PAVAVGLLVN IPVALLAGLS TIAVFAFLQI WVEPRLYGAN RVSPVLVVLI
LMVMAQAFGI LGMLIAPPLA ATIQILGREL WFSKSPEEQL ADDYRNQLHT LRQRLATIER
DEVTAEVAAT AQHQSYIRQI EALMDQTQAA LGNAGMLNRE QS