Gene Haur_2823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2823 
Symbol 
ID5734704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3590727 
End bp3591953 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content54% 
IMG OID641279966 
Productmajor facilitator transporter 
Protein accessionYP_001545589 
Protein GI159899342 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.087939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCGAA GCCCGTTTTA TGGCTGGTGG ATTGTGAGCA TGCTGGGATT TACCGAGATG 
ACTTCGTGGG GCGTGATTTA CTACGCATTT AGCGTGCTCT TGACCCCGAT GCAGCGTGAG
TTGGGTTGGT CGCAAGCCCA TTTTACGGGT GGATTTTCGC TGGCCTTGTT TATTTCGGGA
ATTGTGGCTT TGCCCGTTGG CCGCTGGCTC GACCAGCATG GCGCACGCGG TTTGATGACA
CTTGGCTCAT GTTTGGCGGC AATTTTGGTG GTGGCATGGG CCAATGTTCA ATCGCTCTTG
GCTTGGTATC TGATTTGGGC TGGCTTGGGT TTGGCGATGG CGGCAATTTT ATATGAGCCA
GCGTTTGCCG TGGTGGCAAC GTGGTTTCAG CAAAAACGCC AACATGCCCT GACAATTTTG
ACGGTTGGCG GTGGCTTGGC CAGCGTCGTA TATGTGCCCT TAGTTACGCG ATTGCTCGGC
ACACTGAATT GGCGCGAGGT GTTGCTGTGG CTGGCAGCGA TTTTGGCAGT GCTAACGATT
CCGTTGCATG GCTTGGTATT GCGTGGTAAG CCCGCCGATT TGGGCTTATT GCCTGATGGC
GGATCGCTGG CAGTGGCAAC CGTTACGCCA AATCCCACGA TTCAGCCTTC GATGTCGTTG
GGAAATGCCA TTCGAGCAAG CTCATTTTGG TGGTTGGCGC TGGCTTTTGG CCTGACCACG
ATGGCAACAT TCACCTTAGG CGTGCATTTG ATTTCAGCGA TTCAAGCCCA AGGCTATGCC
CCAGAGATTC AAGCCTTGGC TGTGGCCTTG CTGGGTGGTT CGCAAATTCC CAGCCGAATT
GTGATTGGAA GTGTTGGGCG ACGTTGGCCC CAAGTCCAAT TAGCATGGAT GTTGTGTTTG
CTGCAAAGTG CTGCGTTTGC CATCTTCCTG TTTGTGCCCA ATGTGACAGG GCTACTGCTG
TTTGCTTGCT TGTTTGGGGC GGGATCTGGC GCGTTGACCC CGACACGAGC AGCATTGGTT
GCCGATGTTT TTGGCACAGC CCAATATGCC AGTATCAGCG GGGCGTTGGC ATTGTTGACC
ACAACAGCTC GGGCCTTGGC TCCAGTTTTG GCTAGCCTGT TGGTGGGATT GTTGCATAGC
TATCAACCGC TGTTTGGCTT GCTGTTGCTG ATGTGTTTGA TCAGCGCAGC GGCAATTTAT
TTGATTCGAG GTGCAAGTAA TGGCTAA
 
Protein sequence
MRRSPFYGWW IVSMLGFTEM TSWGVIYYAF SVLLTPMQRE LGWSQAHFTG GFSLALFISG 
IVALPVGRWL DQHGARGLMT LGSCLAAILV VAWANVQSLL AWYLIWAGLG LAMAAILYEP
AFAVVATWFQ QKRQHALTIL TVGGGLASVV YVPLVTRLLG TLNWREVLLW LAAILAVLTI
PLHGLVLRGK PADLGLLPDG GSLAVATVTP NPTIQPSMSL GNAIRASSFW WLALAFGLTT
MATFTLGVHL ISAIQAQGYA PEIQALAVAL LGGSQIPSRI VIGSVGRRWP QVQLAWMLCL
LQSAAFAIFL FVPNVTGLLL FACLFGAGSG ALTPTRAALV ADVFGTAQYA SISGALALLT
TTARALAPVL ASLLVGLLHS YQPLFGLLLL MCLISAAAIY LIRGASNG