Gene Haur_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1191 
Symbol 
ID5733084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1369675 
End bp1371558 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content48% 
IMG OID641278331 
Productamino acid permease-associated region 
Protein accessionYP_001543967 
Protein GI159897720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000300657 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGACCC AATTGAAGGG GGTTTTAGTG GGGAATCCGT TGGAGACTGC GGCCCAATCG 
CATGAGCGCT TGGATAAAAA AACCGCGCTC GCAGTCTTCT CGTCTGATGC TTTATCATCA
GTCGCCTATG CCACCGAAGA AATGCTTGTC CACCTTGTGC CAGCCGGCAT CATTGCGTTT
AGCTCATCGC TGTGGCTGGG CATCGGCATC GCTGTTTTGT TGATGATCGT GACGATCTCG
TATCGCCAAA CAATCACGGC CTATCCCAGC GGTGGTGGCT CGTACATCGT GGCCTCGGAT
AATTTAGGCA CTCTTCCGGG TTTGATCGCT GGCGGTGCAT TATTAATTGA CTATATTCTG
ACTGTGGCAG TGTCGATCTC GTCGGGCGTT TCGCAGTTGA TCTCATTGGT CGAGCCGTTG
CGTGATTATC GAATTGAAAT TTGTTTGATT GGGATTCTCA TTCTGACCTT GGCCAATTTG
CGTGGGATTC GCGAATCGGG GGCGATTTTC TCGCTGCCTA CCTACTTTTT TATTACAGTC
ATTATGTTGA CCCTCGGCTA TGGCTTTTAC AAACAATTTA CAGGCGATAT TCAGCCGTTA
GTGCTTTCGG ATAACCTGAT GGGGCCACAT GAACAAAGCT TCTCGCCATT TGGAACTGAG
GCTATGACCG CCTTTTTGCT GATGGGCGCG TTTGCCTCGG GCTGTTCAGC ATTAACCGGG
GTTGAGGCAA TTTCGAATGG CGTACCAGCA TTTCGCAAGC CAGAGCCACA CAACGCTCGC
GTCACCATGG TTTGGATGGC GGGATTACTA TTAGTGATGT TCGCTGGCAT TACCTGGTTT
GCTCACAAGT ATGGCGCACG CCCGCAATTC AACGAAACCG TGATTTCGCA AATTGGGCGA
GGCATTTGGG GGCGCACGAC GGGCAGCGAA ACGGGCTTCC CCAAAGTGAT GCATGGCATG
TTGCAAATCT CAACCGCCGC AATTTTGCTG GTTGCAGCCA ATACCAGCTA CGCCGATTTC
CCACGATTGA TGTCGTTGTT GGCCCGAGAT GGCTTCTTGC CCCGCCAATT CTCATCATTG
GGCGATCGTT TGGTCTTCTC GAATGGGATT CTTTTCTTGG CGGTTGCTGC TGCACTGTTG
GTGATTGGTT TTGATGGCTC GGTTACCAAC TTAATTCCCT TGTATGCGGT TGGCGTGTTT
CTCTCATTCA CACTCTCGCA ATCGGGGATG GTGTTGCGCT GGTTGCGGCT CAAAACCAAG
GGTTGGCAAC TTAATTTAGT GGTGAATGCA GTTGGCGCGA TCGCAACTGG GATCGTTTTG
ATCATCAACG GCACAACGAA ATTCAAAGAA GGTGCATGGT TGGTGGTTAT TTGTATTCCA
ATTCTCGTTT TGATTTTTAC TACGATCAAT CGTCATTACA AAGGCGTAGC CAAACAACTT
TCATTGGAAG GGTTTAGCAA ACCCGTGCCA TTAGAAAATA ATGTGATTGT GCTCGTATCA
TCATTGCATC GTGGCACGGT TAAAGCGCTT GAATATGCCA AATCAATTGC TCCAGGTAAA
GTTCGCGCTT TGTATATTGA ATTTGAGCAT GAACACGAAA AAACTGAACG TCTACAAGAA
CGCTGGCAAC AGTGGGAGCC AGATGTGCCC TTGGATATTG AAATATCTAA ATATCGTTCA
TTGTTACGCC CAGTTTTACG CTATGTTGAT CGGATTGAAG CTGAGCGCAA TGATGATATT
CTAACCATTA TCTTGCCTGA ATTTATTCCG GCGCGAATTT GGGAATATGC CTTACATAAC
CAAACCGCCT TCTTCTTGAA AGGTGCGCTG CTATTCCGAC GCAATAAAAT CGTGATTAGC
GTGCCATATC ATCTTGAACG CTAA
 
Protein sequence
MLTQLKGVLV GNPLETAAQS HERLDKKTAL AVFSSDALSS VAYATEEMLV HLVPAGIIAF 
SSSLWLGIGI AVLLMIVTIS YRQTITAYPS GGGSYIVASD NLGTLPGLIA GGALLIDYIL
TVAVSISSGV SQLISLVEPL RDYRIEICLI GILILTLANL RGIRESGAIF SLPTYFFITV
IMLTLGYGFY KQFTGDIQPL VLSDNLMGPH EQSFSPFGTE AMTAFLLMGA FASGCSALTG
VEAISNGVPA FRKPEPHNAR VTMVWMAGLL LVMFAGITWF AHKYGARPQF NETVISQIGR
GIWGRTTGSE TGFPKVMHGM LQISTAAILL VAANTSYADF PRLMSLLARD GFLPRQFSSL
GDRLVFSNGI LFLAVAAALL VIGFDGSVTN LIPLYAVGVF LSFTLSQSGM VLRWLRLKTK
GWQLNLVVNA VGAIATGIVL IINGTTKFKE GAWLVVICIP ILVLIFTTIN RHYKGVAKQL
SLEGFSKPVP LENNVIVLVS SLHRGTVKAL EYAKSIAPGK VRALYIEFEH EHEKTERLQE
RWQQWEPDVP LDIEISKYRS LLRPVLRYVD RIEAERNDDI LTIILPEFIP ARIWEYALHN
QTAFFLKGAL LFRRNKIVIS VPYHLER