Gene Haur_4641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4641 
Symbol 
ID5736488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5929015 
End bp5931447 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content54% 
IMG OID641281805 
Producthypothetical protein 
Protein accessionYP_001547400 
Protein GI159901153 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0331671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCTT TTTTTCGTCG GCAACGCCCG ACATCCCTCC GCGAACGAGG TAAAGGCAAA 
GTTCGCTGGA CAATCTGGGA GTTAACAGCG CTGGTTGTTA GCATCGTTAT GATGGCAACC
CCTTTGTTAG CTGCGATTAG CAGTGCTATT TTTCCATTTA ATGCTCAGGC GCAAGCCACG
ATTCGGCCAA CGTCAGCAGC AACACCGGTT ACGCCGTTAC CACCAACCGC CACCCAGACC
TATCCGGTGC CAACCGAAAC ACCGCTTGTC ACGCCGTGTC CAACGTTTGA GCCAACCGAA
ACACCAACCG AAACCGCCAC AACCTATCCT ACACCTGGCA CACCTGGCAC ACCGACCGAT
ACGCCAAGTG TCACGGATAC GCCAACGACA ATTATTACGT CAACCAATAC ACCAGCTGTG
ACCGATACGC CGAGCGTCAC CGACACCCCA AGTGTGACAG ACACGCCGAG CGCACCAACG
GCTACGACGA TCATCACACC AACTGAAACC AGTGTTGTAA GCCCAACGAT ACCCCGTGGT
GGGATGAGTG GAAGCACGCG TTTGCATCGG CCTTTAGCCC AAACGCCTGA TCCATGTGCA
ACACCACCAA CGATTGTCAT TCCACCAACC GATACCATTC CACCTGATGA AACACCAACG
GGCACAATCA CTGGTCCGGT TACACTGACC CCAGCAACGC CAACCATTGA TTGTTTGGTT
GCAAGTTGTA CGCCAGTTCC AACCCTACCA ACTGATATTG CCACGGCAAC CGAAACCACC
TTGCCAGCCG ATGAGCCAAT CGCGGTGGGC AAGAGTAGTT CGCGGGCGCT CGTGCAGCCT
GGCGAAACAT TTAGTTATAT CATCACGGTA AGTTTCCAAG ATAATGGCGA TGGGCAAACT
TCGCGCTCAG TCAGCATTAG CGACCCATTA CCAAGCCAGG TAACCTTTAT TGCAGTTCAA
CAACTTGGCA CAGCGACCTG CGTTGGTGGC ACAACCGTCA ACTGTAATGG GACGGTCAGT
GCAGGTAACC CAATTGTGGT CACGATTCAA GTCCAAGTTA ATGCCAGCGT GGCATTGGGC
ACGAATATCG TTAATATCGT TAGTGCCACG GCGGCCAATC GCACGTTGCA AGCAAGCGAT
ACCGTGATTG TGCCTGATAC CTTACCAACC AGCACGCTTG GCACAGCTGG CCCAAGCTTC
ACGCCAATTG TTGTGACCAA TACACCAATT ACGCCAATTG TGACCAATAC TCCGATTACG
CCAATTGGTG TAACCAATAC TCCACCTACC AGCACACCAG TCACACCAAT TGGTGCAACC
AACACACCTG GTGGCTCAAC CAGCATACCT GTTACGACTG GTCCAAGCAA TACGCCACGG
CCAAACCAGC CGAGCAATAC GCCACGGCCA AATCAGCCGA GCAATACGCC ACGGCCAAAC
CAGCCGAGCA ATACGCCACG GCCAAATCAG CCAAGCAACA CACCACAGCC GAATCAGCCA
AGCAACACAC CACGGCCTGA TATTACGCCA GTGCCAGCAA CCAATGTGCC TGTTGCCACG
GTTGTTCCAC CAAGCAACCC AAGCGCAACG CCACGCCCAG GTGTGCCAGT GCCTTCGGCA
ACTCAGCGCC CAGGCGGTGG TTCAAATCCA AGTGCAACGC CACGGCCAGG CGCACCAGTG
CCTTCGGCTA CCAATGCACC AGCTGGCAGC CAACCAACCA ATGCACCCGC GCCAGCTACA
GCAACGCCAA CCAATCCAGC TGGCTTCGTC ACCGATCCAA TCGTGACTGG CTTGCAGTTC
CAAAAGAAGA GCGATTGGGG CAGCCGCTTT GCAGGCGAAA GTTTGATTTA CACAATCACG
ATTATCAGCC CAACTAATTC GTTGAATGCT GGTACGATGC GTGATGTCGT GGTGGTTGAT
CAATTGCCAA GCAACTTGGA AACCAATGGC CCAATCAAGG TCAGCGACCA AAATGCACGG
GTTGAACAAC AAGGCAACCA AATTACCGTG CGGGTCGGGG TCTTGCCAGC AGGCCAAACC
TTGACAATCA TGATTCCAGT CAAGATCAAA GATGGTGTGG CTGCTCAAAC GCGGATCGTC
AACCAAGCTC AGTTGAATTT CACTGGCTTG GCCCAGCCAA TCTATTCGAA TATTTCGAGT
GTTTTGGTGG TCGGCGAAGC TCCTGCAGTC AGCGCCACAG CAGTTCCTAA GGGCAATGTC
GGCGGCGGTG CTGCAACTGC CAACCCAGCA ACCGTCACGC CAAATACTGG CATTGGCGGC
GGCCAAGGTA GTGGCGATGG CACTGGCGCA ACCGATGTTG GGGTTAGCAA CCCAGCTACG
AATATGGGTA TTCCAGCAGC AGGCTTTGTG CTCTTCGCCC TGACGATGTT CGTTCACGTT
ATACGGGTTC GCCGCGAAAT GACGCGGATC TAA
 
Protein sequence
MKSFFRRQRP TSLRERGKGK VRWTIWELTA LVVSIVMMAT PLLAAISSAI FPFNAQAQAT 
IRPTSAATPV TPLPPTATQT YPVPTETPLV TPCPTFEPTE TPTETATTYP TPGTPGTPTD
TPSVTDTPTT IITSTNTPAV TDTPSVTDTP SVTDTPSAPT ATTIITPTET SVVSPTIPRG
GMSGSTRLHR PLAQTPDPCA TPPTIVIPPT DTIPPDETPT GTITGPVTLT PATPTIDCLV
ASCTPVPTLP TDIATATETT LPADEPIAVG KSSSRALVQP GETFSYIITV SFQDNGDGQT
SRSVSISDPL PSQVTFIAVQ QLGTATCVGG TTVNCNGTVS AGNPIVVTIQ VQVNASVALG
TNIVNIVSAT AANRTLQASD TVIVPDTLPT STLGTAGPSF TPIVVTNTPI TPIVTNTPIT
PIGVTNTPPT STPVTPIGAT NTPGGSTSIP VTTGPSNTPR PNQPSNTPRP NQPSNTPRPN
QPSNTPRPNQ PSNTPQPNQP SNTPRPDITP VPATNVPVAT VVPPSNPSAT PRPGVPVPSA
TQRPGGGSNP SATPRPGAPV PSATNAPAGS QPTNAPAPAT ATPTNPAGFV TDPIVTGLQF
QKKSDWGSRF AGESLIYTIT IISPTNSLNA GTMRDVVVVD QLPSNLETNG PIKVSDQNAR
VEQQGNQITV RVGVLPAGQT LTIMIPVKIK DGVAAQTRIV NQAQLNFTGL AQPIYSNISS
VLVVGEAPAV SATAVPKGNV GGGAATANPA TVTPNTGIGG GQGSGDGTGA TDVGVSNPAT
NMGIPAAGFV LFALTMFVHV IRVRREMTRI