Gene Haur_5045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5045 
Symbol 
ID5737004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp60328 
End bp62916 
Gene Length2589 bp 
Protein Length862 aa 
Translation table11 
GC content57% 
IMG OID641282212 
Productlipid transport protein 
Protein accessionYP_001547803 
Protein GI159901557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0150437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGGA TCGTTAACAG ACTGATATCA TTGGCCCTGA TCAATGTTGT TATGCTTTCG 
CTCGTCTTGT TTCCCAGTTC CACACGTGCG TCCGTAACCG CACTGCGGTC ATACGGCTAT
CGTGTGGGCA GCCAATACGA CTATACTTGG AGCATGGATG TCACGACCAG TGGGACTAGC
CAGAACCAAA CCGGAAAGAC GACCCAAACT CGTTTATCGC GCATCGTCGG CAGCGTCACC
ATTATTCCAT TGGGTCCCAA GGACGATGGG TCAATGGTAC TTGAGGCATT AGCCAGCGAT
ATCCGCGCCT TTACGACCAA CACCGAAGGT GTCTTGATTG AGGCCACCGA GGTTCCTAGC
ACCACCATCC ATCCGGAGTT GCCCTTTTAC TTCGCCCAGA ATCCCTCTGG CCGCGTAACT
ATGGTCTTGC TCAACACCGA GGAATCCCGT GAATCGGCCA ATTTCAAGCG CGGCATAGTC
TCGCTCTTCC AAATGCAATT GTCTGGTCCC GCCGAGAGCC TTGCCGAAGA CGACTCCTCG
GGTGTCTACG AGGCGCAATA CCGTACCACC GATCAGGCGA CAACGGTCAC CATTACCAAA
GATCGCACTC AGCAGAACTA TCAGGTATTC GCTGATTCCT CAATCGCCGC GCACGGCAGT
TCGGCACCAT CGAGTGCCCT CGGAGTACCC GCTACGTCAC GCGCTGAGAA TCTAGCGATT
AGCGACAAGA GCACCGCTGT GTTTGATCGT AACGAGCGCG TGCTTCGCTC TGTCGCCGTA
ACAAGCAGTG TTGCTGGTAC CACTGATGGC TACAGCGGTG AGGGTGTCGC CATGACCTCC
GCAGCGTCCC TAAAAGGTAT ACTCACCCTG CAATCGATTA GCCCACGCGC TGGACGCGCT
TTCCCGACCT TTCCGGCGAC CGCCACGACC AACGACGTGA TTACGGCGAT CCAAACCTTC
CGCCGGGTCA CACTCGCCAG CAGCTCGATG GCCGGTGTCG TTGCAGATCT GACGCCGCCC
ACCGATGAGG TCGCAACCCT AGACGAGGCG CTGGCTGTAC TCTCCGCTCA GCCCGACGAC
CTTGCCGCAA CGAGTACCCT GCGCGAGACC TTGCAGGCTA ACGTCAATCG CCTGTCCGAG
CTGGATATGG CCTTACTCAA CGGCACAGTG GCACCCGTGC TCTATTCCGG GATCATCGTG
GCCTTGAGCG GCGTGTCCCA TCCGCAGGCA CAGTCGATTC TGCTTTACCG TTTTATGGCA
AACTCCGCGC TTGACGTGGC GGTGCGGAAT CGCGCACTGA TGGCCGTCGT AGGGATCAGT
GTACCCACCA GCGAGCTGAT TCAGGGTGTG CAACGTTTAA GCGAAAGCGG CCCGCTGTCC
GAACAGGCGA CGCTGATCCT AGGTGCGCTT GCTACCAATG TGGAGCCAAG CCGCCCCCAG
GATGCCGCAC AGATTGTGAC CGTGCTCAAA GGCCGCCTAG CTACTGCCCA GACCCCGGAG
ACGATCATCT TGGCCCTTGA TGCCTTGGGT AACGCAGGAC CACTGGTTGA GCTGAGTGTG
ATCACGCCCT ACCTGAACAA CTCCGCGTTC GAGGTTCGCA AATCGGCGCT GGATGCCTTG
CGCGGCTATC CCTACGAGCA AATCGCCTCG ACGATTAATC AGGTGTTGGT CAGTCCAGAC
CACGCCCTGC GAGCTGAGGC AACGGCGATT GCCGCTGCCT CAGGAGTAAT CTTGCCACTG
GACGATTCGG CTGTTAACGC CCAGCCAGGG GAATATCACA AGGAGTGGCA GAACTTCATC
GGCGGTGGCG ATGTTAAAGG CGAATTGCTG GCGAATCTTC ACTTCTGGCC CAATGATCCC
TTCAATCCGG GCTGGATGTC CACGGAATCG CACATGAAAG CCCGCTTCTG GTCGCAAACC
GTCTCGGTAG CACGGGCAAA AGCATGGAGT TATATGACCT CGGCAACAGC CACGACGCGC
TATTATCGTG CGCAATTCTA TCTGCTGGGC TTCAAGGTCT ATGACTTCAA CAAGACCCTG
CCCTGCCAGT CCTCAACCAG TGGCAATATG TTCAACGAGT CGCTGACCTT CTTCCACTTA
GAGCAAACCA TTTTTGTCTA CGGTATTCCG ATCACGCTGG AGGCACTCGC GTCGGCATCG
GTGACTGTCC CTTGGACGAT CGGGGTCACG AACTGCAGCA ACCCATTGCA TGCCCAGGCG
TGGGGCAACA TAACGCCCCA AGTGAAAACA TACGTGGAGG GTTCTGCCTC GGTCGACCTA
GCTGTCTTCC GCGCGGGCAT TGGCGTGGGT GCGGAGCTAT TCGACACCAG CCTCTCGATG
AACGCGATGG TCTTGGCAGC GCTGGCACCC AATCCAGGGC TGGGTCTCTC GATCAACGTA
ACCGGTCATT TTGAGCCGAT CAACGCTCAT ATTTTTATCT TCTATCAGTG GCGGACGTTC
AAGTGCAAGA AGAAGCTCTT TGGCGTTTGC GTATGGCCAA CTTTCCCTTG GGGAACCAAG
CACTATACGG ATCTGTGGAA TTATCACGGC CCGGCTTACG ACTGGACTAT TATCAACTTC
AACAATTGA
 
Protein sequence
MQRIVNRLIS LALINVVMLS LVLFPSSTRA SVTALRSYGY RVGSQYDYTW SMDVTTSGTS 
QNQTGKTTQT RLSRIVGSVT IIPLGPKDDG SMVLEALASD IRAFTTNTEG VLIEATEVPS
TTIHPELPFY FAQNPSGRVT MVLLNTEESR ESANFKRGIV SLFQMQLSGP AESLAEDDSS
GVYEAQYRTT DQATTVTITK DRTQQNYQVF ADSSIAAHGS SAPSSALGVP ATSRAENLAI
SDKSTAVFDR NERVLRSVAV TSSVAGTTDG YSGEGVAMTS AASLKGILTL QSISPRAGRA
FPTFPATATT NDVITAIQTF RRVTLASSSM AGVVADLTPP TDEVATLDEA LAVLSAQPDD
LAATSTLRET LQANVNRLSE LDMALLNGTV APVLYSGIIV ALSGVSHPQA QSILLYRFMA
NSALDVAVRN RALMAVVGIS VPTSELIQGV QRLSESGPLS EQATLILGAL ATNVEPSRPQ
DAAQIVTVLK GRLATAQTPE TIILALDALG NAGPLVELSV ITPYLNNSAF EVRKSALDAL
RGYPYEQIAS TINQVLVSPD HALRAEATAI AAASGVILPL DDSAVNAQPG EYHKEWQNFI
GGGDVKGELL ANLHFWPNDP FNPGWMSTES HMKARFWSQT VSVARAKAWS YMTSATATTR
YYRAQFYLLG FKVYDFNKTL PCQSSTSGNM FNESLTFFHL EQTIFVYGIP ITLEALASAS
VTVPWTIGVT NCSNPLHAQA WGNITPQVKT YVEGSASVDL AVFRAGIGVG AELFDTSLSM
NAMVLAALAP NPGLGLSINV TGHFEPINAH IFIFYQWRTF KCKKKLFGVC VWPTFPWGTK
HYTDLWNYHG PAYDWTIINF NN