Gene Haur_1525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1525 
Symbol 
ID5733412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1778227 
End bp1779297 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID641278665 
ProductPTS system, mannitol-specific IIC subunit 
Protein accessionYP_001544297 
Protein GI159898050 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2213] Phosphotransferase system, mannitol-specific IIBC component 
TIGRFAM ID[TIGR00851] PTS system, mannitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.003101 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACAGC ATATTCAGCG CTTCGGCAGC TTCTTGGCTG CAATGGTTGT TCCCAATATC 
GGTGCGTTCA TCGCTTGGGG CTTAATCACC GCTCTGTTCA TCCCAACTGG TTGGATTCCG
AACGAAACTT TTGCCAAGCT CGTTGGCCCA ATGATTACTT ACCTTTTGCC CACCTTGATT
GGTTATACTG GCGGTAAGAT GATTCACGAT ACCCGTGGCG GGGTGGTTGG GGCCGCCGCA
ACCATGGGTG TAATCGTCGG AGCCGATATT CCAATGTTTA TCGGGGCCAT GGTCATGGGG
CCACTTGGCG GCTACCTGAT GAAGAAAATT GACGAATTCC TCGAAGATCG CGTGCCAACT
GGCTTTGAAA TGTTGGTCAA TAACTTCTCG GCAGGCTTCC TTGTGATGGG CTTGGTGCTA
TTGGCGAATG CGGCGATTGG CCCTGTTGTC CAAGTGTTAA GTGAGGGCTT AGGCTCTGGC
GTTCAATTCC TGATCGACTC ACGCTTGTTG CCTGTGGCCG ATATTATTAT TGAACCAGCC
AAAGTACTCT TCCTCAATAA CGCCATCAAT CACGGGATTC TTGGCCCCTT AGGCGTGGCT
GAAGTTGGCG AAAATGGTAA AGCGATTCAC TTTTTGCTCG AAAGTAACCC CGGCCCTGGC
TTAGGCTTGC TCTTGGCCTT CTGGTTTGCT GGCAAAGGCA ACCTCAAAGA ATCAGCACCA
GGTGCTGCAG TAATTCACTT CCTCGGTGGG ATTCACGAAA TCTATTTCCC CTACATTTTG
GCTCGTCCTG TGATGATTGT GGCAATGTGG GCTGGCGGGA TCTTAGCCGA CCTGTGGTTT
GTGATTTTTG ATGCAGGTTT GGTTGCTACG CCATCACCTG GCTCGATCTT TGCCTACTTG
GCGGTTATTC CACGTGGCGG TCACTTTGCG GTCTTAGGTG GGGTTTTCGT CGGGATTGTT
GGCTCGTTTG TGGCTGGCTC GATGTTGCTC AAACTCTTCC CGGGCGAAGA AACTGTGGCA
GATGAGCAAA CCAGTTCAGA AATGACTGGC ACCGCTGCAA CCGCCCACTA A
 
Protein sequence
MRQHIQRFGS FLAAMVVPNI GAFIAWGLIT ALFIPTGWIP NETFAKLVGP MITYLLPTLI 
GYTGGKMIHD TRGGVVGAAA TMGVIVGADI PMFIGAMVMG PLGGYLMKKI DEFLEDRVPT
GFEMLVNNFS AGFLVMGLVL LANAAIGPVV QVLSEGLGSG VQFLIDSRLL PVADIIIEPA
KVLFLNNAIN HGILGPLGVA EVGENGKAIH FLLESNPGPG LGLLLAFWFA GKGNLKESAP
GAAVIHFLGG IHEIYFPYIL ARPVMIVAMW AGGILADLWF VIFDAGLVAT PSPGSIFAYL
AVIPRGGHFA VLGGVFVGIV GSFVAGSMLL KLFPGEETVA DEQTSSEMTG TAATAH