Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1190 |
Symbol | |
ID | 5733083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1367183 |
End bp | 1369144 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278330 |
Product | amino acid permease-associated region |
Protein accession | YP_001543966 |
Protein GI | 159897719 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000537417 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATCA AACAGCTACT CATCGGGAAG CCGTTCCCCA GCAGTAGTGC CCAGCACGAA CGGCTCGACA AAATTCGCGG ATTGGCAGTC TTTGCATCCG ACCCAATTAG CTCCAATGCC TATGCCACTG AAGCGATTAT GCGGGTGCTG ATTTTGATCA GCGCCGCAGC ATTAAGTTAC ACCTTGCCGA TTGCGATCGG GATTGCGGCC TTGGTGCTGT TGGTCGTCTT GAGCTACAAC CAAACCATTC ACCACTACCC AATGGGTGGC GGCGCGTATA TGGTCAGTAA GGATAACCTC GGTCGCACAG CTTCATGGTT GGCAGGTGCT TCGATTCTTA GTGATTATGT GCTGACCGTG GCGGTTTCGG TTTCGGCGGG GGTTAAAGCA ATTTCTTCGG CCTTCCCCGA TGTCGCATTC TTTCATGAGC ATCGGGTATT AATCGGAATT GGGATTATCC TTTTAATTAC CTGGCTGAAC TTACGTGGTG TGCGCGAAAG CGGCACGATC TTTGCCATCC CCACCTATGC CTTTGTGTTT GGGGTGTTCG TAGTGATCGC GATTGGCCTT GCCCGCTATT TTGGAATTTT CGGCGAGCCA TTGCCACCAC GCAGCGTCGC AACCGACGAA ACCGCTCGTT CAGGCCTTGA TAACTTTGGC TTGATCTGGC TGGTCTTGCG GGCCTTTGCT GGTGGTTGTA CCGCCTTGAC TGGGATTGAA GCAATCAGCG ACGGCGTACA AGCGTTTCGT AACCCAGCGC CAAAAAATGC GATCATTACC ATGCGAGCCA TGGCCGTCAT GGCCATGACC TTGTTTATTG GAATTAGCTT TATTGCTACC CATATTCCGA TTACCCTGCT GCATGAAGGT GGCGAGAGTG TACTTTCGCA AATGACACGC ACGATTGTTG GTAGTGGGTT TCTGTACTAC TGGGTTCAAT TTACCACCAT GTTGATTTTG ATCCTAGCTG CTAACACTGC TTATGCCGAC TTCCCACGGA TCGCGGCCTT CTTGTCGAAT GACGGTTTCT TGCCGCGTTG GCTCTCGCGC TTGGGTAGCC GTTTGGTCTA TAGCTCAGGC GTGATCGCGC TGGCCTTTTT GGCCTCGGCC TTGCTGGCAG CTTTCGGCGG CGAAGAACAT CACCTGTTGC CATTGTATGC GATTGGGGTG TTCCTCTCGT TTACGCTTTC GCAAGCTGGC ATGATTGTGA TGTGGCGCAA AGTTGCCAAA CTCAAGCCAG GCGAAAGCCT CGACACTGGC ATCACCACTC TGCACTATGA GCCAAACTAC AAGCTCAAAC GAATTCCCAG CATCATTGGG GTTGGCTTGA CGGCTGTGGT TTTGGTGGTG TTGACAGTTA CCAAATTTAC CGAAGGTGCA TGGCTGATTA TTGTGGCCTT GCCGTTGATC ATGCTTTTGT TCCGCAAGAT CAAAGCACAC TATGACCATG TGGCAACCAA TTTGAGCCTA ACAGGCTTGA AACCAAGCGA TTTGCGTTCG CCCGCTGATG TGGCGATTGT GCCAGTTGGC AGCATTCATC GTGGCTCGTT GCGTGCGATC AAATATGCCT TGAAACTAAC CGACGATGTG CGCGTGGTCC AAGTTGTTGG TAGCGAAGAG GAAGAAATCA AAACCCGCAA ACGCTGGGAA CAGTGGGACG AAGTGCTGGG CAAGGCCAAA TTGGTCTTCT TGCACACCGA CTACCGCGAT TATCTCACGC CATTGGTCGA TTACGTCGAT CAAGTCAACA ACAAGGAATT TCCAGGCGAT TTGATTACCG TGGTTATTCC GGAGTTTGTG CCCGATTCGA CCATGGCCAA AGTGCTGCAC AACCAAACTG CTGTGATGCT ATTGCTGGCA TTGCGCAAAT ATGAAGATGT GGTGGTAATC AGCGTCCCTT ATCACTTGCA CTATATTCCA ACTGGCTCGG AAGATATTGT GGCCCAAAAA CCAGCCGCCT AA
|
Protein sequence | MNIKQLLIGK PFPSSSAQHE RLDKIRGLAV FASDPISSNA YATEAIMRVL ILISAAALSY TLPIAIGIAA LVLLVVLSYN QTIHHYPMGG GAYMVSKDNL GRTASWLAGA SILSDYVLTV AVSVSAGVKA ISSAFPDVAF FHEHRVLIGI GIILLITWLN LRGVRESGTI FAIPTYAFVF GVFVVIAIGL ARYFGIFGEP LPPRSVATDE TARSGLDNFG LIWLVLRAFA GGCTALTGIE AISDGVQAFR NPAPKNAIIT MRAMAVMAMT LFIGISFIAT HIPITLLHEG GESVLSQMTR TIVGSGFLYY WVQFTTMLIL ILAANTAYAD FPRIAAFLSN DGFLPRWLSR LGSRLVYSSG VIALAFLASA LLAAFGGEEH HLLPLYAIGV FLSFTLSQAG MIVMWRKVAK LKPGESLDTG ITTLHYEPNY KLKRIPSIIG VGLTAVVLVV LTVTKFTEGA WLIIVALPLI MLLFRKIKAH YDHVATNLSL TGLKPSDLRS PADVAIVPVG SIHRGSLRAI KYALKLTDDV RVVQVVGSEE EEIKTRKRWE QWDEVLGKAK LVFLHTDYRD YLTPLVDYVD QVNNKEFPGD LITVVIPEFV PDSTMAKVLH NQTAVMLLLA LRKYEDVVVI SVPYHLHYIP TGSEDIVAQK PAA
|
| |