Gene Haur_4799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4799 
Symbol 
ID5736644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6116933 
End bp6118150 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content52% 
IMG OID641281965 
Productmajor facilitator transporter 
Protein accessionYP_001547558 
Protein GI159901311 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.896954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGG TTCCTTCGAC GCATGTGCGT CATCCAATGT GGCCAATTTA TCTCTGCGTT 
GCGGTTACTG CCCTTGGAAT TGGGATCATC AACCCGTTGA TTCCGCATTT ACTTGAGCAA
AACGGCGCTA ATGAATTTAT TGTGGGCTTG AGCACCAGTG TGATGTTTGC CAGCCTTGTG
CTAACTGGCA TGCCGATTGG CCGCACAATC GATAAATTTG GCATTCGACT ATTTTTGATC
GTTGGCTTAA TTTCCTACAC CGTTGCTATG CTGGCAATGC CTTGGACGCA TAGTATCCCG
CTCTTTTTCA TGTGGCGGAT TTTCGAGGGG ATTGGTTGGT CGTGTGTTTG GACTGCCGCT
GAAACCTATG TCAGTCGGGT TAGTTCGCCT GCACAACGAG GGCATAACAC CGCCGTCTAT
GGCATGTCGC TGGCCAGTGG TACAGCAGCA GGTCCAATCA TCGGCACAGC GGTTTATGAA
TGGACCAAAA ACCCTGTCCA TCCATTTTTG GTAGCAGTCG CGGCAGCGAT TATCGCGACC
GTGATTGTGG TCTTGGTTGT GCCAGAACCG CATGTTCATC ATACAGAGCA GGAAAGTGGT
GGTGTCAGCT TTAAAATCAC CCGCCCATTA ATTTTGCCTT TGGCGATTGC CTTCCTGTAT
GGTTATGGCA CGTTGTCGCT GGTTTCATTG CTACCAACCC TCAATTATAG CAACTGGGAG
CTGGGTACGC TGATTTCGGT AACGGTCATT GCCAACATTG TGGCCCAAGT GCCAATTGGC
CGAATGCTTG ATCGCTATGG CTATCGGCCA TTACTAATTG GCTCGTTATC GCTCCTTTCG
GCGGCAGCCT TGTTCTCGAC CCTGCACCCA CCGTTCTTGC TAACCCTATT TCTCGGGATG
TTGCTGGGCG CATTCGCTGG CACGCTCTAT CCAATTGGGT TGGCTGTGTT AGCAGCGCGG
GTTCCACCAG CCAAATTGGG CGGTGCTAGC GGCATGTTTA CGGTTTGCTA TGGGCTTGGC
AGTTTTGTTG GGCCTGCGCT CACAGGCGGC GTAATGTCCA TGGTTGGTGC GAAACATAGC
GACCAAGCCT TGTTTGGCAC GATTGGCTGT TTGGTGCTCG CACTCTTGGG CTTAATGATG
ACCGGAATCG ATCGGGTCAA CGTCAAAGAG GAACATCCAG TTAGCGTAGC CTCAAGCGGC
ATCAAGCCAC CAATGTAA
 
Protein sequence
MQQVPSTHVR HPMWPIYLCV AVTALGIGII NPLIPHLLEQ NGANEFIVGL STSVMFASLV 
LTGMPIGRTI DKFGIRLFLI VGLISYTVAM LAMPWTHSIP LFFMWRIFEG IGWSCVWTAA
ETYVSRVSSP AQRGHNTAVY GMSLASGTAA GPIIGTAVYE WTKNPVHPFL VAVAAAIIAT
VIVVLVVPEP HVHHTEQESG GVSFKITRPL ILPLAIAFLY GYGTLSLVSL LPTLNYSNWE
LGTLISVTVI ANIVAQVPIG RMLDRYGYRP LLIGSLSLLS AAALFSTLHP PFLLTLFLGM
LLGAFAGTLY PIGLAVLAAR VPPAKLGGAS GMFTVCYGLG SFVGPALTGG VMSMVGAKHS
DQALFGTIGC LVLALLGLMM TGIDRVNVKE EHPVSVASSG IKPPM