Gene Haur_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2086 
Symbol 
ID5733974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2597208 
End bp2598986 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content50% 
IMG OID641279227 
ProductABC transporter related 
Protein accessionYP_001544854 
Protein GI159898607 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTT GGCAATTAAT GCTGCGTTTG ATCCGCTATC GGCCAGCAAT CTATGGCTTT 
GGGGTCTTGA TTTGGAGCAT ATTTTTTATC TCGCCCTTGC TGCCTGGCCT GATCAATCGG
GCAATTTTCG ACCAACTCAG CGGCGCGGCT CCAGCGGTAT TTGGCATCTG GACGCTCTTA
GCCCTCTTGA TCACCACTGA GATTAGCCGA CGTTTATTGA CCCAACTTGG AACCGCGATG
GATATCTCGG TGCAATATAA CATTGCAGGC TTGTTGCGCA AAAATATGCT GAAGCGGATT
TTGGCTCAGC CAGGTGCCCA AGCCTTGTAT ACCTCATCAA GTGCGGCGAT CAACAATTTT
CGCGATGATG TTGAGGAAAT TCTGACCTTT AGCATTTGGC TGCTTGATGT ATTTGGTAAT
GTGCTGTTCA GCCTGATTGC GCTAGGAATC CTGTTACAAA TCAATCAACA GGTAACCTTG
GTTATTTTGG TTCCATTGCT GATCGTGGTT TCGGCCTTTA CCGTTGCCAA TCAGCGGATT
CAGCGCTATC GTCAGGCCAG CCGCGCCGCA ACGAGCAAGG TTTCCGATTT TCTGGGCGAA
ATGTTTGGGG CAACCCAAGC GATCAAAGTG GCGAATGCTG AAACTGCGGT AATCGGCCAT
TTTCAACAAC TCAACGACCA ACGTCGCCAG TTTATGGTCA AAGATCGGGC ATTTACGGCC
TTGCTTAGCT CCGTTTACGC CAACACAATT AGCCTTGGCA CGGGCATGAT TTTGTTGCTA
GCAGGCGAGG CAATGCGTAC TGGCAATTTT AGTGTTGGTG ATTTCTCGCT GTTTGTCTAT
TATTTTTCAA TGGTTACCCG TTTGCCCTCA CTGATTGGCT TGTTGTTGAC CCACTACAAA
CAGGCTGGAG TTTCGTTCCA GCGCATGCAA CAGCTATTAA AGGATGCAGC ACCAAGCGCT
TTGGTTGAAC ATGGCTCGAT TACGCCCGAT CAACAGCTGG ATTTTAGCCC AGCCAAGGCC
TTGCCAGCAT TGCAACGGCT TGATATAAGC AACTTAAGCT ATTGCTACCC CAACAGCCAA
GCTGGTATCA AAGCGATTGA TTTCAGGCTT GAGCGTGGCC AATTTGTGGT AGTTACGGGC
AAAGTTGGCT CAGGCAAAAC CACCGTATTG CGGGCCTTGC TTGGTTTAGT GCCAGCCCAT
GGCACGATTC TCTGGAATAA TCAGCCGATC GAACAACCTG CAGAGCAACT CATTCCCCCC
AATTGTGCTT ATACCGCGCA AGTGCCGCGC TTATTGAGCA GCTCGCTGCA CGATAATTTG
AGCCTAGGGC TGAATTTGAG CGCTAATGAA TTACAACAGG CGATTGAAAC GAGTGTATTA
GCGCCAGATC TACAGCATAT GCCCGCAGGC CTAGCGACCG AACTTGGTTC ACGGGGCGTG
CGTTTATCGG GCGGTCAACT ACAACGAGCA GCTTTAGCGC GAATGTTAGT GCGTTCGAGC
GAGTTGATGA TCGTTGATGA TTGTTCGAGT GCCTTAGATG TGACCACTGA GCAACAACTG
TGGCAAGGCC TACGTCAGCA CCCAACCACA TGGCTGGTTG TTTCGCATCG CTGCGCCGTC
CTTCAGCTTG CCGATTGGGT AATTGTTATG GACGCTGGTC AGATTGCAGC CCAAGGCCCA
CTAAATGATC TACTCGAAAC ATCGCCAGCT ATGCGTGAAC TTTGGCAAGC TGAGCCGAGC
AATCAAAAAC TATTGCTTGG TGTTGATGCG CACGTTTAG
 
Protein sequence
MKAWQLMLRL IRYRPAIYGF GVLIWSIFFI SPLLPGLINR AIFDQLSGAA PAVFGIWTLL 
ALLITTEISR RLLTQLGTAM DISVQYNIAG LLRKNMLKRI LAQPGAQALY TSSSAAINNF
RDDVEEILTF SIWLLDVFGN VLFSLIALGI LLQINQQVTL VILVPLLIVV SAFTVANQRI
QRYRQASRAA TSKVSDFLGE MFGATQAIKV ANAETAVIGH FQQLNDQRRQ FMVKDRAFTA
LLSSVYANTI SLGTGMILLL AGEAMRTGNF SVGDFSLFVY YFSMVTRLPS LIGLLLTHYK
QAGVSFQRMQ QLLKDAAPSA LVEHGSITPD QQLDFSPAKA LPALQRLDIS NLSYCYPNSQ
AGIKAIDFRL ERGQFVVVTG KVGSGKTTVL RALLGLVPAH GTILWNNQPI EQPAEQLIPP
NCAYTAQVPR LLSSSLHDNL SLGLNLSANE LQQAIETSVL APDLQHMPAG LATELGSRGV
RLSGGQLQRA ALARMLVRSS ELMIVDDCSS ALDVTTEQQL WQGLRQHPTT WLVVSHRCAV
LQLADWVIVM DAGQIAAQGP LNDLLETSPA MRELWQAEPS NQKLLLGVDA HV