Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5045 |
Symbol | |
ID | 5737004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 60328 |
End bp | 62916 |
Gene Length | 2589 bp |
Protein Length | 862 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282212 |
Product | lipid transport protein |
Protein accession | YP_001547803 |
Protein GI | 159901557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0150437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGGA TCGTTAACAG ACTGATATCA TTGGCCCTGA TCAATGTTGT TATGCTTTCG CTCGTCTTGT TTCCCAGTTC CACACGTGCG TCCGTAACCG CACTGCGGTC ATACGGCTAT CGTGTGGGCA GCCAATACGA CTATACTTGG AGCATGGATG TCACGACCAG TGGGACTAGC CAGAACCAAA CCGGAAAGAC GACCCAAACT CGTTTATCGC GCATCGTCGG CAGCGTCACC ATTATTCCAT TGGGTCCCAA GGACGATGGG TCAATGGTAC TTGAGGCATT AGCCAGCGAT ATCCGCGCCT TTACGACCAA CACCGAAGGT GTCTTGATTG AGGCCACCGA GGTTCCTAGC ACCACCATCC ATCCGGAGTT GCCCTTTTAC TTCGCCCAGA ATCCCTCTGG CCGCGTAACT ATGGTCTTGC TCAACACCGA GGAATCCCGT GAATCGGCCA ATTTCAAGCG CGGCATAGTC TCGCTCTTCC AAATGCAATT GTCTGGTCCC GCCGAGAGCC TTGCCGAAGA CGACTCCTCG GGTGTCTACG AGGCGCAATA CCGTACCACC GATCAGGCGA CAACGGTCAC CATTACCAAA GATCGCACTC AGCAGAACTA TCAGGTATTC GCTGATTCCT CAATCGCCGC GCACGGCAGT TCGGCACCAT CGAGTGCCCT CGGAGTACCC GCTACGTCAC GCGCTGAGAA TCTAGCGATT AGCGACAAGA GCACCGCTGT GTTTGATCGT AACGAGCGCG TGCTTCGCTC TGTCGCCGTA ACAAGCAGTG TTGCTGGTAC CACTGATGGC TACAGCGGTG AGGGTGTCGC CATGACCTCC GCAGCGTCCC TAAAAGGTAT ACTCACCCTG CAATCGATTA GCCCACGCGC TGGACGCGCT TTCCCGACCT TTCCGGCGAC CGCCACGACC AACGACGTGA TTACGGCGAT CCAAACCTTC CGCCGGGTCA CACTCGCCAG CAGCTCGATG GCCGGTGTCG TTGCAGATCT GACGCCGCCC ACCGATGAGG TCGCAACCCT AGACGAGGCG CTGGCTGTAC TCTCCGCTCA GCCCGACGAC CTTGCCGCAA CGAGTACCCT GCGCGAGACC TTGCAGGCTA ACGTCAATCG CCTGTCCGAG CTGGATATGG CCTTACTCAA CGGCACAGTG GCACCCGTGC TCTATTCCGG GATCATCGTG GCCTTGAGCG GCGTGTCCCA TCCGCAGGCA CAGTCGATTC TGCTTTACCG TTTTATGGCA AACTCCGCGC TTGACGTGGC GGTGCGGAAT CGCGCACTGA TGGCCGTCGT AGGGATCAGT GTACCCACCA GCGAGCTGAT TCAGGGTGTG CAACGTTTAA GCGAAAGCGG CCCGCTGTCC GAACAGGCGA CGCTGATCCT AGGTGCGCTT GCTACCAATG TGGAGCCAAG CCGCCCCCAG GATGCCGCAC AGATTGTGAC CGTGCTCAAA GGCCGCCTAG CTACTGCCCA GACCCCGGAG ACGATCATCT TGGCCCTTGA TGCCTTGGGT AACGCAGGAC CACTGGTTGA GCTGAGTGTG ATCACGCCCT ACCTGAACAA CTCCGCGTTC GAGGTTCGCA AATCGGCGCT GGATGCCTTG CGCGGCTATC CCTACGAGCA AATCGCCTCG ACGATTAATC AGGTGTTGGT CAGTCCAGAC CACGCCCTGC GAGCTGAGGC AACGGCGATT GCCGCTGCCT CAGGAGTAAT CTTGCCACTG GACGATTCGG CTGTTAACGC CCAGCCAGGG GAATATCACA AGGAGTGGCA GAACTTCATC GGCGGTGGCG ATGTTAAAGG CGAATTGCTG GCGAATCTTC ACTTCTGGCC CAATGATCCC TTCAATCCGG GCTGGATGTC CACGGAATCG CACATGAAAG CCCGCTTCTG GTCGCAAACC GTCTCGGTAG CACGGGCAAA AGCATGGAGT TATATGACCT CGGCAACAGC CACGACGCGC TATTATCGTG CGCAATTCTA TCTGCTGGGC TTCAAGGTCT ATGACTTCAA CAAGACCCTG CCCTGCCAGT CCTCAACCAG TGGCAATATG TTCAACGAGT CGCTGACCTT CTTCCACTTA GAGCAAACCA TTTTTGTCTA CGGTATTCCG ATCACGCTGG AGGCACTCGC GTCGGCATCG GTGACTGTCC CTTGGACGAT CGGGGTCACG AACTGCAGCA ACCCATTGCA TGCCCAGGCG TGGGGCAACA TAACGCCCCA AGTGAAAACA TACGTGGAGG GTTCTGCCTC GGTCGACCTA GCTGTCTTCC GCGCGGGCAT TGGCGTGGGT GCGGAGCTAT TCGACACCAG CCTCTCGATG AACGCGATGG TCTTGGCAGC GCTGGCACCC AATCCAGGGC TGGGTCTCTC GATCAACGTA ACCGGTCATT TTGAGCCGAT CAACGCTCAT ATTTTTATCT TCTATCAGTG GCGGACGTTC AAGTGCAAGA AGAAGCTCTT TGGCGTTTGC GTATGGCCAA CTTTCCCTTG GGGAACCAAG CACTATACGG ATCTGTGGAA TTATCACGGC CCGGCTTACG ACTGGACTAT TATCAACTTC AACAATTGA
|
Protein sequence | MQRIVNRLIS LALINVVMLS LVLFPSSTRA SVTALRSYGY RVGSQYDYTW SMDVTTSGTS QNQTGKTTQT RLSRIVGSVT IIPLGPKDDG SMVLEALASD IRAFTTNTEG VLIEATEVPS TTIHPELPFY FAQNPSGRVT MVLLNTEESR ESANFKRGIV SLFQMQLSGP AESLAEDDSS GVYEAQYRTT DQATTVTITK DRTQQNYQVF ADSSIAAHGS SAPSSALGVP ATSRAENLAI SDKSTAVFDR NERVLRSVAV TSSVAGTTDG YSGEGVAMTS AASLKGILTL QSISPRAGRA FPTFPATATT NDVITAIQTF RRVTLASSSM AGVVADLTPP TDEVATLDEA LAVLSAQPDD LAATSTLRET LQANVNRLSE LDMALLNGTV APVLYSGIIV ALSGVSHPQA QSILLYRFMA NSALDVAVRN RALMAVVGIS VPTSELIQGV QRLSESGPLS EQATLILGAL ATNVEPSRPQ DAAQIVTVLK GRLATAQTPE TIILALDALG NAGPLVELSV ITPYLNNSAF EVRKSALDAL RGYPYEQIAS TINQVLVSPD HALRAEATAI AAASGVILPL DDSAVNAQPG EYHKEWQNFI GGGDVKGELL ANLHFWPNDP FNPGWMSTES HMKARFWSQT VSVARAKAWS YMTSATATTR YYRAQFYLLG FKVYDFNKTL PCQSSTSGNM FNESLTFFHL EQTIFVYGIP ITLEALASAS VTVPWTIGVT NCSNPLHAQA WGNITPQVKT YVEGSASVDL AVFRAGIGVG AELFDTSLSM NAMVLAALAP NPGLGLSINV TGHFEPINAH IFIFYQWRTF KCKKKLFGVC VWPTFPWGTK HYTDLWNYHG PAYDWTIINF NN
|
| |