Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2727 |
Symbol | |
ID | 5734608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3482884 |
End bp | 3484833 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279870 |
Product | ABC transporter related |
Protein accession | YP_001545493 |
Protein GI | 159899246 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.26231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTCT TAATTGCTAA TAATCTCGGC AAATGGTTTG GAGCCGAACA AATATTTGAA GCTGTCTCGT TCCAAGTGGC TCGTGGCGAC AAAATCGCCT TGGTTGGGGT CAATGGCGCG GGCAAATCTA CGTTAATGAA AATTATCGCT GGCATCGATA GCTCCAGCGA AGGTTCGTTG CATCGCTCAC GCGGTTTGCG CGTGACCTAC CAAGCTCAAG AAGCAACGTT CGCCGCCGAT TCGACCTTAG AGCGCGAAGC GCATGCCGCC TTTGCTGCGC TGAGCAACAT CGAAGACGAA ATGCGCCAGC TCGAAGTTAC AATCGCCAAT CCCGATGATC CCCAATGGGA ACAGGCGATG GAGCGCTACG GCGAGTTGCA ACATCGCTAT GAGCATGCTG GCGGCTATGA AAAAGAACAT CGCATTACCC GTACCTTCCA AGGTTTGGGC TTTACTGATG CTCAATGGAC GCAGCCGATT GCTCAGTTTA GTGGTGGTCA ACGCACCCGC GCCGCGCTTG CCGTGGCCCT ACTAGGAGAT CCCGATATTC TATTGCTTGA CGAGCCGACC AACCACTTGG ATATGGCGGC CTTGGAATGG CTCGAAGACT TTTTGCGCGA TTGGGAAGGC ACATTGATCG TGATTTCCCA CGACCGCTAC TTCCTTGATC GGGTTTCAAA TCGCACTTGG GAAATGGAGT GGGGCCGCTT GCAGGATTAC GCCGCGCCCT ATTCCAAATA TCAAACGATC AAGGCTGAAC GCATGGAGCG TTTAGCCAAA GAGTTTGAAG CCCAACAACA GATGATCGCC AAAACCGAGG AATTTATTCG GCGCTTCAAG GCTGGGGTTC GTGCCCGCGA AGCCAAAGGC CGCGAACGCC GACTTAATCG CTTTAAAGAA GGCTGGAATA GTATTCACGG CCATGTTAAA GCGATTGAAG GCCCGCAACG CCGCAAAGAA CTTAAATTTG CCTTGCAAAC CAACCTTCGT TCTGGCGATG TTGTGCTAGC GCTCGATCAA TTGGGAGTTG GCTATACCAA CAACGGACAA ACCACCACTT TGCTACAATT TGATGAATTG TATGTGATGC GCGGCGAACG GGTGGCCTTG CTGGGGCCAA ATGGCAGCGG CAAATCAACC TTGCTCAAAA CCGTGGTCGA TCAACTCAAG CCCTTGGCTG GTAGTTTTGA GGTTGGAGCC AACGTACAGC TTGGCTATTA TGCCCAAGGT CACGAAGGCC TCGATTTCAA CAACACGATT TTGGATGAAG TGCTGCGCCA TAACCCGCAA ATGGGCGAAA CCCGGGCACG TACCATGCTC GGCAACTTCT TATTTACCAG CGATGATGTA TTCAAGCAAA TTCGCGATCT TTCGGGCGGC GAGCGTTCGC GAGTAGCCTT ATCGCAGTTG ATGCTCAATG GTGGCAACTT CTTGATGCTC GACGAGCCAA CCAACCACTT GGATATTCAG GCCCGCGAGG CGCTTGAAGG CGTGCTTAAC GATTTTAATG GTACCTTACT GTTTGTCTCG CACGACCGCT ATTTTATCGA TGCAGTCGCT GATACCTTGT GGTTGGTCAA CGATGATGGC AGCATTACGC GCTTTCCAGG CAATTATTCG GCGCTTGCTG CTCAGCGAGA AAACGAACGT CGTGCTGCTG AAGCCGCCGC GATCGAGGCC AAACGCGCTG CCGAACGCCA AACCAAGGCC AACAAAGCCA ATCCAACGCC TGTGCCAGCC AGTGCCAAGC GCCAATTGCA AAACCTTGAG CGCGAAATTG CCAGCCTAGA GCAACGCAAA GCCGCGCTTG ATGCCGAAAT TATGCAAGCA TCAATTAAGC AAGATAGCCG CAAAATTGGC GAGCTTGGCA CGCAATATGC CGCACTCGAA AACCAACTCA GCGATTATTA CACCCGCTGG GAGCAATTGG CCGAAGAAGT TGGAGCCTAA
|
Protein sequence | MSVLIANNLG KWFGAEQIFE AVSFQVARGD KIALVGVNGA GKSTLMKIIA GIDSSSEGSL HRSRGLRVTY QAQEATFAAD STLEREAHAA FAALSNIEDE MRQLEVTIAN PDDPQWEQAM ERYGELQHRY EHAGGYEKEH RITRTFQGLG FTDAQWTQPI AQFSGGQRTR AALAVALLGD PDILLLDEPT NHLDMAALEW LEDFLRDWEG TLIVISHDRY FLDRVSNRTW EMEWGRLQDY AAPYSKYQTI KAERMERLAK EFEAQQQMIA KTEEFIRRFK AGVRAREAKG RERRLNRFKE GWNSIHGHVK AIEGPQRRKE LKFALQTNLR SGDVVLALDQ LGVGYTNNGQ TTTLLQFDEL YVMRGERVAL LGPNGSGKST LLKTVVDQLK PLAGSFEVGA NVQLGYYAQG HEGLDFNNTI LDEVLRHNPQ MGETRARTML GNFLFTSDDV FKQIRDLSGG ERSRVALSQL MLNGGNFLML DEPTNHLDIQ AREALEGVLN DFNGTLLFVS HDRYFIDAVA DTLWLVNDDG SITRFPGNYS ALAAQRENER RAAEAAAIEA KRAAERQTKA NKANPTPVPA SAKRQLQNLE REIASLEQRK AALDAEIMQA SIKQDSRKIG ELGTQYAALE NQLSDYYTRW EQLAEEVGA
|
| |