Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0909 |
Symbol | |
ID | 5732810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1040964 |
End bp | 1043405 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278041 |
Product | hypothetical protein |
Protein accession | YP_001543685 |
Protein GI | 159897438 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3127] Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTGG GTTTTACCTT TAATTATGCT TGGCGCTCGT TACGCTTGGG TGGTCAGCGC ACGCTGTTGG CAATTATCTG TATTGGCTTT GGCGTGATGT CGCTTGGCTC GATGCAAAGT TTATCGACGG CGATCAATCA AATTTTTATT GAAAATCGGG TTAGTGTTGG CGGCGATGCC ATGCTCGATT GGCCTAACGG CTCAATTGGC CCTGAACAAC AAGCCCAACT TGAACAATGG AAACAAGCAG GCGTGATTGG CGGCTATGTG GTTTATTCGG CAATTCCGCC TGGGTTGCTC AAGCCTGCTG GCGGCGATCA TGTGATTTTT GGTGATATTG GCTATGGGAT TGATCCACAG AGCTACCCGT TGTTGGGCGA ATTGCATGTG AGCCAACCCG CCAACGCCAG CATGAGCGAG TTGCTGAGCC AGCCCGATGC CTTGGTGGCT ACTGAGCTTT TGGCAATTCA GCATGATCTG ACGATTGGCC AAGAACTGGT GATTTTGGTT GATCAAGGGG CAACGCCGAA ACGCCTGAAA TTGGTTGGTT TGGTGGATCA AGTGCCGAAC AAGAGCGCTG GTAGCTTATT TTTTAGCCTG GCGACGGCCC ACGAAATTTA CCCTGAAGCC AATTTGAACT CAGTGAGGGT GGTTTGGGGC CAGCAAGGTG TGCAAGTTGG CGTGCTAGAG CAAGCTGGTT GGTCGGTTGC CACCGTCAGT AGCGAGCCAT CGGATGCCGC TGGTTTATTC AATATGACCT TGCGCGGTGC TGGCGTGTTG GGCTTGATCG TCAGCGGAAT TGGCGTGGCC AATACCATGC AAGTTGTGCT TGCACGTCGC CGCAACGAAA TCGCCATCCT CAAAACCTTA GGCTATCGTG GCCCACAATT GTTGTTGCTG TTTGGCTTTG AAACGGCGTT GCTTGGACTG ATTGGCAGTA TCTTGGGTGC TGTGGCCGCC GTATTAATTG GCGATCAACT CACCGATTTG TTTGCACGTA GCAGCGCCTA CATCCTACCA ACCGTCGTCG ATTGGCAAAT TTTGGGTGGG GCAATTGGCT TGGGCATCGC CACAACCTTG ATCTTTGGCA TGGTGGCGAT TGTAAAAGCC AATGCGGTAC GACCTGGCTC GTTGCTACGT TCTGGCCCGA TCGAAGTTAA CCCGACGACC CGCCGCGCCA TGGTTGGGTT GTATACAGCC CTAGGGGCCA TGTTTGCCGT GGTTGCTAGC ATTAGCATGG GTTCGTTTGT TGCGGGCATG CTCTTGTTTG CGGGAGCGGT GTTGGGCTTG GCGTTGCTCA ATTGGCTGTT TCAAGCGATT TTGTTGTTGG TTGCCAAAAC GCCGTTGCCT GGCTATTGGC TGCGTTTGGC CAGTCGCAAT ATGCAACGCA ATGGTCAGCG GGCTGCCTTT GCAATTATCG CCTTGGCGAT CGGGGTGTTT ACGATTGGCT TTTCGGCAGC AGCCTTATTG ACGGTCCAAA AAGAACTTGA TGCCCGCGCC GATCCAAATA CCAATCTCAA TCGGGCGATG TGGGTGATTA CTGCCCCCAG CGAGGCCTCC AAGGTTGCCC AAACCTTGCA ACAGTTGCAG CAACCAACGC TTGAGCCAAT TCAATTGTTG GAAGTTGAGA CTTCGTTGCC CGAGCTTGAA GGCGTGAAAT TTGAAACCTC GGCCCGCTCA GCCACGCAAA TCGGCGATAT TCAGCTGACC GAAGGCCAAT GGCAGGCTGC GGCTGATCAA GTGATGTTTT CAACATGGTT TTTCAGCGAA ATTCCGCTTG GCACTGAGCT AGAATTGATT GGATCGAACG GTCGATTAAA CGTTCGCTTG GTTGGGCGCT ACGAAAATCA AGCTAGCCAA GCATCGAGCG CCATTGTGAT TGGCCCTGAC GCTGTGTCGC AATTGGTTGC ACAACCTGTT ACGCTGATTT ATGTGCTCAA TCTGCCGATT AACCAATTGA GCACTGCGAC CAACACGTTT AATCAGGCGA TTCCCCAAGC CTTTGTTTAT AACGAAGTTG AAATGTATAA CACTACTCAA ATGATCTATC GCTCGTTAGG CTTGTTTGTG GTGGCGGTGG CTGGCTTGGC GTTTGTGGCG GGCATGGTGC TGATCGCCAA TGCCGTTGGT TTGGCATTGT TTGAACGTCG CCGCGAGATG GGCATTTTCA AGGCGGTTGG CTATAGCACT GCGCACTTAT TGCGCAGCAT CAGCATGGAA TATAGCTTGG TCGGCTTGAT TGCTGGTAGC GCTGGTATGC TGGCAGTTTG GCTTGCGATC ACAGTGATTA ATACGCTTGA ACCCAAGGCT GGGCTTGGGC TAGATGCCTT GCCAGGTTTG CTGATTTTTG GCTTTGCAAT CGGCCTAGCG CTGTTGACTG CCTTGGGGGT TGCTTGGCGA CCCGCCCATC TGCGCCCGCT GCATGTGCTG CGCGACGAAT AA
|
Protein sequence | MGLGFTFNYA WRSLRLGGQR TLLAIICIGF GVMSLGSMQS LSTAINQIFI ENRVSVGGDA MLDWPNGSIG PEQQAQLEQW KQAGVIGGYV VYSAIPPGLL KPAGGDHVIF GDIGYGIDPQ SYPLLGELHV SQPANASMSE LLSQPDALVA TELLAIQHDL TIGQELVILV DQGATPKRLK LVGLVDQVPN KSAGSLFFSL ATAHEIYPEA NLNSVRVVWG QQGVQVGVLE QAGWSVATVS SEPSDAAGLF NMTLRGAGVL GLIVSGIGVA NTMQVVLARR RNEIAILKTL GYRGPQLLLL FGFETALLGL IGSILGAVAA VLIGDQLTDL FARSSAYILP TVVDWQILGG AIGLGIATTL IFGMVAIVKA NAVRPGSLLR SGPIEVNPTT RRAMVGLYTA LGAMFAVVAS ISMGSFVAGM LLFAGAVLGL ALLNWLFQAI LLLVAKTPLP GYWLRLASRN MQRNGQRAAF AIIALAIGVF TIGFSAAALL TVQKELDARA DPNTNLNRAM WVITAPSEAS KVAQTLQQLQ QPTLEPIQLL EVETSLPELE GVKFETSARS ATQIGDIQLT EGQWQAAADQ VMFSTWFFSE IPLGTELELI GSNGRLNVRL VGRYENQASQ ASSAIVIGPD AVSQLVAQPV TLIYVLNLPI NQLSTATNTF NQAIPQAFVY NEVEMYNTTQ MIYRSLGLFV VAVAGLAFVA GMVLIANAVG LALFERRREM GIFKAVGYST AHLLRSISME YSLVGLIAGS AGMLAVWLAI TVINTLEPKA GLGLDALPGL LIFGFAIGLA LLTALGVAWR PAHLRPLHVL RDE
|
| |