Gene Haur_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1447 
Symbol 
ID5733311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1680909 
End bp1682207 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content51% 
IMG OID641278585 
Productmajor facilitator transporter 
Protein accessionYP_001544219 
Protein GI159897972 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.674302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAC TTGATACAAA TAATGACGCG CAGGCCTCAA CTAAGCTTGC GCGTTCATTA 
GGTTGGACAT TGGTGATTAA ACTGCTGATC TCGCTGCATG TTGGGGTTGG CGATTTTTTG
TTGCCCTTGT ACGTTCAAGC GCTGGGTGGT TCGCCCCAGG TGATTGGCAA TTTGGTGGGT
TTGGGCGCGG CCTTTGGCGT ATTTGCGCGG CTCAGCCTGA GTTGGCTTGC CGATCGTGGG
ATCATCGGGA TTTTGCTGCG TTTGAGTTTG GCGGCCTTAG CGAGTGGCTT TGCAATTTGG
AGCTTTGCTG ATAGCAGTGT TTGGTTAATG CCTGGTCAAG CATTGCTCGC GATCGGGCGA
GCAGGCTCGA CTATCTGTTT AAGTTTGTTG ATTGCCCAAT TAACCAGCCA AGGTCAACGG
GGTTCAGGCT ATGGGCGGCT GACCATGGCC AGTTCATTGG CAACAATTGT TGGGGCGATC
ATCGCGGGCA TTGGCTTTAT TGGCTGGGAT GCAGAAACTC GCCAGCAATT GCAACAATTC
ACTTGGCTGC AAACTAGCCT CAATTATTTA CCCAATCCAA TTCCCCGTGT CGAGCTATTT
CATGGCATTT ACATCGTGTT TAGTAGTAGC GTCGTCGTAG CAGGAATCTT CAGTTTACGT
TCATGGCCCC ATGCAATTCC GGCAAAACGC GGTACAATAC AAGCAATATG GCGCAGTGTG
TTACGTCAGC CAACAATCCG CAGTCTGTTG CTTGTGCAAA GCTGCATCAG CGCGGGCTAT
AGTGCCTCGA TTCCAATGAC CGTGCCGTTA TTGACTGATC GTTTTGGGGC AAGTGTGGCG
GCGGTCGCTG TGGCCTATAT TGTGCCAGGC ATTATTTATG CGCTGTTTCC AGCACGCCTT
GGGCGCGTCG CTGATCGGAT TGGCTATCGA CGGGCTGCCA AGCTTGGGCT TGGCGTAAGT
ATGTTAGTTT ACCTAGCAAT ACCTATCAGC CCACAACTTG CAATCACCGC AGCATTTTGG
GCTTTTGAAG CGCTGGCATG GAGTTTTTAT GTACCAGCTT TGGAAGCCTT ACTTGCGGAA
AGTGTGATAC CACAACAACG CGGCACAGCC TTGGCGATCT ATGGAGCGTC AGGCGCATTG
ACCGCAACTG TGGCAGCACC CTTGGGCGCA CGCTTGTATA GCCATTGGAT CGCCGCACCC
TTTTTATTCT CGGCCCTTTG CCTAGGTATG GCAGCCATGT TTGCTGCCCG TACCCCACCA
ACCAATGCCG ATTATGCTAC CATTAAATCT CACAATTGA
 
Protein sequence
MNKLDTNNDA QASTKLARSL GWTLVIKLLI SLHVGVGDFL LPLYVQALGG SPQVIGNLVG 
LGAAFGVFAR LSLSWLADRG IIGILLRLSL AALASGFAIW SFADSSVWLM PGQALLAIGR
AGSTICLSLL IAQLTSQGQR GSGYGRLTMA SSLATIVGAI IAGIGFIGWD AETRQQLQQF
TWLQTSLNYL PNPIPRVELF HGIYIVFSSS VVVAGIFSLR SWPHAIPAKR GTIQAIWRSV
LRQPTIRSLL LVQSCISAGY SASIPMTVPL LTDRFGASVA AVAVAYIVPG IIYALFPARL
GRVADRIGYR RAAKLGLGVS MLVYLAIPIS PQLAITAAFW AFEALAWSFY VPALEALLAE
SVIPQQRGTA LAIYGASGAL TATVAAPLGA RLYSHWIAAP FLFSALCLGM AAMFAARTPP
TNADYATIKS HN