Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4038 |
Symbol | |
ID | 5735900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5154098 |
End bp | 5155888 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281189 |
Product | hypothetical protein |
Protein accession | YP_001546798 |
Protein GI | 159900551 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGTTGT TGTTACGGCA ACGTGTGAGT TTAGCCTTGG TTGTTGTGGC GCTCGCAGCA TGTCAACAAG CCCCACAGCC CACTGCTCGC CCGTTACAAA ATGAGCTAAC TATTCTCGCC TTAACTCCGA CAGGCACGGC TACTCCTACG GCAACGGTGA CATTAACGCC TGCACCTGCG TCGCCAACAA CCGAGCCTTC GGCTACCCCA ACCCGCACAG TTGGCCCTTC GCCTACCAAA GGCGCTTCTC CAACCGCTGG CCCTTCACCA GTCGGTACAC CTCGGGCCAC TCCCGCGAGT ACCCCAACTG CCAATCAAAG CAGTAGTTTT TGTACGCAGC CATTTGGGGC GGTAACCGAC GAGCGCTTCA GCGCCCGCTT AAATAACGCT GGGCTTGATC GCACGCCCAA TGGCGATCGC TTATTCTTAG AATTAACCTC CAGCAGCGGC CCCGTCAACG GTGTCGTGCG CTGTGTGCCT CCAGCAGCGG CCCAATTGCT CGCAGGCGAT AGCGCAATTG CCAGCGTGAT TCAAATCGAT CTGCCGCTAT GGCGACACGA TGATCTCTGG CGTTCATCGT CGGTCACGCT CACCAAAGTG CTCAAACTTG ATACGCTGCA ACATGTACGC TCAGTCGTTT CGCAATCTAG CAGCGATTCA GCTGGGGTAT TAATTGAAAT TGGGCTGGAT CAGGCTTTGC CATTTACGGT GCAGCTCGAT GGCGGACGCT TAAATGTGGT GATTGCTGAT AGCGCTACCG CCACGCTGGG CGATGATCCA CTAGCCAAGA GCAATGGTTC ACCCAGCGCA CCCAAGCAAC CAGTTGTGTT TGCCAGCAAA GGCGATTTGT ATCGCTACGA AAGCTCGCGA GTCGTGCCAA TTACCACAAC CTTGGCAATT GAAAGTGCAG TGGCAATCAG CCCTGATCGT ACCCAAATTG CCTTCTGTCG CGCCAACCCC GATGGCTTGC CAACCCAAGG CGCACTCTGG ACCAGCACAA TTGATGGCGA TAATGAAACC TTGGTCGCTG ATGTTGGTGG TTGTGCCGAG CCTGCTTGGT CGCTTGATGG CGGGATCATT TGGTTTACTG CCCCTTGGAG CGATGCGGCC CCTGATAGCT ATCGACTCTG GCAGGTGAAA GCCAATGGCG GCGATGCGAG CGCGGTCTCG CCGCTCGACG AGTGGAGCCG CCGTATGCCG CATGCCTTGC CCGATGGCTC AGTACTGACG GTTGGCCATA CCGATGGCGG TCAAGGTGGC TTGTTGATCA GTAATCCGTT GAGTGGCACT GATGGATTGC TTGGCCAAGC GAGCTTGGGC AATTATCGCA GCGTTGGTCA AGCCCAAGTT TCGGCTGATG GCACCCGGAT TGCCGTCGAA GCGCTCCGAG CTGATGGTGG CGCGGATCTC TTGGTGCTCG ACCAAACTGG GAAGCAACTC GATGCAATTA CCGACCAATG GTGGGTACGC CCGCTCTCGT GGAGCAGCGA TAACAAACTC TACTATTTAA ATGTGGCTTG TCGTAGCGGC CAAGTATTGA ATTATAGCCT GCACAGTCGC CAAGGCAGCA ACGATAGTCA AATTATCAAA GGTGCAACCC TTGGCGATTT AGGTTCAGTC GCTGTGGTTG ATGATGCGCT GTTGTATGTT CGGGCTTTAC AATCGCCCGA CAACGAACGT GGTGCAGAGC CTATGATTAG CGGCCCTAGC GAGTTGTGGT TGTATGATCT TTCGAATACA GCTCGTACCC GCCTGATTGC CGCCGATGAT GGAATTACCA GCGTCAAGTA A
|
Protein sequence | MKLLLRQRVS LALVVVALAA CQQAPQPTAR PLQNELTILA LTPTGTATPT ATVTLTPAPA SPTTEPSATP TRTVGPSPTK GASPTAGPSP VGTPRATPAS TPTANQSSSF CTQPFGAVTD ERFSARLNNA GLDRTPNGDR LFLELTSSSG PVNGVVRCVP PAAAQLLAGD SAIASVIQID LPLWRHDDLW RSSSVTLTKV LKLDTLQHVR SVVSQSSSDS AGVLIEIGLD QALPFTVQLD GGRLNVVIAD SATATLGDDP LAKSNGSPSA PKQPVVFASK GDLYRYESSR VVPITTTLAI ESAVAISPDR TQIAFCRANP DGLPTQGALW TSTIDGDNET LVADVGGCAE PAWSLDGGII WFTAPWSDAA PDSYRLWQVK ANGGDASAVS PLDEWSRRMP HALPDGSVLT VGHTDGGQGG LLISNPLSGT DGLLGQASLG NYRSVGQAQV SADGTRIAVE ALRADGGADL LVLDQTGKQL DAITDQWWVR PLSWSSDNKL YYLNVACRSG QVLNYSLHSR QGSNDSQIIK GATLGDLGSV AVVDDALLYV RALQSPDNER GAEPMISGPS ELWLYDLSNT ARTRLIAADD GITSVK
|
| |