Gene Haur_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4246 
Symbol 
ID5736100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5417242 
End bp5418489 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content51% 
IMG OID641281401 
Productmajor facilitator transporter 
Protein accessionYP_001547006 
Protein GI159900759 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000179551 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCAA CATCGACTAC CCCGCGCCAA CCCTCGCCGT TGGTCGCTTT ACGCCACCGC 
GATTATCGGC TGCTCTGGAG CGGCCAACTG ATTTCAATTG CTGGCTCGCA GATGCATACC
GTAGCGTTGC ACGTTCAAGT TTATCGTTTA GCTAGCGCGA TTCCTGGGGC TAATCCAGCG
ATTTTTCTGG GCTTGATTGG CTTGTTTCAG TTTATTCCCT TGCTGTTGCT GGCGTTACGT
GCAGGTCTAT TGGCCGATCG AGTTGATCGA CGACGTTTGA TGCTGGTAAC ACAAAGTATC
TTGATGGGGT TATCGTTGGT ATTGGCGGTT TTATCGTGGT TTGGCTTAAT CAATCTGTGG
TTGCTGTATG GAATTATGAT TATCTTTTTC AGCACCAAAA CCTTTGATTT ACCAGCTCGC
CAAGCGTTAA TTCCGCGTTT AGTGCCGCGT GAAGTACTGC CAACAGCATT AAGTTTAAAT
ATGATTGCTT GGCAAATTGG CAATATTGCT GGGCCGGCCT TGGGTGGTTG GTTTGTTAGC
TATTCAATTG CCTTGGTCTA TTTGATCGAT GCGATCAGTT ATGGCGTGGT GGTATTGAAT
TTGTGGCAAA TGCGCGGCAA TTATGCCCCA ACCGAGGTTA AACCAATAAT CAAAGGCTCC
ATGTGGGAAG GTTTGCACTT TGTACGGCGC ACGCCGATTA TTTGGTCTAC CATGGTGCTC
GATTTTATTG CCACCTTTTG TGGTGCTGCC ACGACCCTCT TGCCCTTATT CGCTGATAAA
GTCTTAAAGG TTGATGAAAA AGCCCTCGGT TTGATGTATG CAGCACCAGC AATTGGCGCG
TTAGTAGCCG CCCTCGCCAT GTCGTGGTTT GGCAATCCGC GCCGCCAGGG CATGGTTGTG
GTGGTTTCGG TGGTGCTCTA TGGCTTGGCG ACCATGGTGT TTGGGCTAGC TCCAAGCTTA
CCAATTGCCG TGCTGGGCTT GGCGGGCACA GGTGCAGCTG ATACGGTCAG TGCTGTCTTG
CGCGGCACAA TTCGCCAATT AAACACCCCC GACGAGCTGC GTGGGAGAGC AACCTCGGCC
AATATGCTGT TTTTTCAAGG CGGGCCATTG CTAGGCGAGG TTGAAGCTGG CTTCGCTGCA
TCATTGGTTG GTGCGCCAAT CGCTATCGCT TTTGGTGGCG CGATTTGTGT CGCCGCCGCA
ATCATCATTG CTGTGCGGAT ACCCAGTTTA CGCTTGTACG ATCGTTGA
 
Protein sequence
MASTSTTPRQ PSPLVALRHR DYRLLWSGQL ISIAGSQMHT VALHVQVYRL ASAIPGANPA 
IFLGLIGLFQ FIPLLLLALR AGLLADRVDR RRLMLVTQSI LMGLSLVLAV LSWFGLINLW
LLYGIMIIFF STKTFDLPAR QALIPRLVPR EVLPTALSLN MIAWQIGNIA GPALGGWFVS
YSIALVYLID AISYGVVVLN LWQMRGNYAP TEVKPIIKGS MWEGLHFVRR TPIIWSTMVL
DFIATFCGAA TTLLPLFADK VLKVDEKALG LMYAAPAIGA LVAALAMSWF GNPRRQGMVV
VVSVVLYGLA TMVFGLAPSL PIAVLGLAGT GAADTVSAVL RGTIRQLNTP DELRGRATSA
NMLFFQGGPL LGEVEAGFAA SLVGAPIAIA FGGAICVAAA IIIAVRIPSL RLYDR