Gene Haur_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3941 
Symbol 
ID5735802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4937690 
End bp4940035 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content50% 
IMG OID641281092 
Producthypothetical protein 
Protein accessionYP_001546703 
Protein GI159900456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.254734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGCA ATAGGAAGCG CAGACGAGTA CTATCCTCGT TTCTTATTAC TCTTACACTT 
TTGGTCACCT TTATCTCTTC GGTTTCGGCA ATCTCACTCA ATGCTACCGT TTCACTCAGC
TCCGGTGATT TTGCCAGCGG CTATTTTGGT CTAACCGGCC TAACCCAACA AGATCAGGTG
ATCGGTGATG TTACCTATGG CGGGGTGCAG TTGATCCCTC AGGGTGCGTT GGCGCAATGG
AGTGATGCCA GTAATCAACT TTGTCGGACT TTGGCCGATA TGGGGACTGT TAGTTTCAAG
CAGCACCTCT ATGCGATTGG TGGCTCGACC GCATCAAGCG GGAATGTAGC GGTTGTGTCT
GAGGTCTGTC GAGCAACCGT CACTGATACT GGGGGTGAAA CGACCGAGTG GACTGAACTT
CCTCAAACCT TACCAGTTCC CCTGACGCGG ATGAGTACGG TGGTGGTGAC GAACACCAGT
GATCCTACTA AGGGGATTAT GTATACCTTT GGTGGCCAAA GCGCCTCAAC CGGCGATATT
GAGTATACCG ATAAGATTTA TTCGAATGTG ATCAATGCTG ATGGTTCATT GAACACTTGG
CAAACTCAGG CGTTGACCAC TGGCGAAAAA CTGATCAATA CCACAGCAAC CGCCTACACC
ACACCCAATG GTCAAACCTA TATCTACCTA ATTGGTGGTA AAACGCGCGA TACATCGGCA
CTCTTCTCGC CAATTTATGT GCGCCGATCA GTTCGACGAA CCTTGGTAGG GCCAAATGGG
GTTTTGGGGC CATGGCAGTC AATGCCCGAT TTGCCAATTA CCCCCGATAT GTTTACCCCA
ACCAATGGCT GTGATGAGAA CGTTGGTTTG CATAGTATGG ATGTTGCCAA CTTTGATGCA
ATTACGCTTA CCAGCACCTA TCGGGCCTTT TTAGTGGTTG GTGGCACTTT CGAGTTGGGC
ACAGGCCATG TCGCTGTTGG TTGTACCCGC ACGGTCGAAG GTTCTGCTCA AGCGATGTTG
GGCAAACTCG ATACGAATGG GATGCTGACT TGGGAAACCC AGCGCTATAT CTTGCCTGAA
CCGCTTTCTA GCCCCCGCGT GATTGGGGTC AACCAAAAGA TCTATGTGGT TGGGGGCCGC
CAAGGCAGTG CTGGTGATCC CACCCATCGG ATTTATACTT CCTATATCAA TATCGATAAC
TTTACCTTAC CTGTGTTCGG CCAAAGCAAC TTCCGCGTTT CGGAAAATGC GCTTTTGACC
TCGCAAGCAC GCTCAGGTCA CGGTCTTGAG TTAATCTATA TCAACTTCCG ACCAGTTGCC
TACATGTATG GTGGGATTCG GGTTGGGAAT ACCTATCAAC AAGATGTGTT GTTTGGCTTT
GTTGGGACAG ATGCCGATAT CGACTTGACG GTTGGGGGTT ATCCTTCACC AGGGGTTTAT
CGCTCATCAC CACTCCAACT CCGTGCTCCA GCCATCATTG AGCAAATGCT GTGGGATGCG
ACGCTGCCAA ATCCACCTAT CAACACTGAT ATTCAAATGC AATATAAGCT AGCGGCAACC
CGCGGTGCAC TGGAAACTGC CCAATGGCAA ACTGTTGATG CCTCGCCTGG GAATGATAAC
TACTCGGTGC AAGGGCCGAA TGTTGCTAAT GGGACTCCCG CCGTCCAAGG CCAGTGGTTC
CAGTATCAAG CATTGATGAC CACCCAAAGT CCGACTGAAG TTGGGGCAAC TCCCATTTTA
CGCAATGTGC GGATTAAGTA TAAGGTTGAT GGTCACCCAA GCTTATACGT TGATTCGGCT
ACGATGTCAA CTGTAAGCAC AACTGGCATT ACAGCCTTTA CGGCAACCTT TAAAAATGGG
ATCAAACCTG GCTCAAATGA CACCGAAAAT GTGCTCGATG CCGATATTGA AAGCCAAGGC
ACCTTCTTTG TGGACATGTA TCTCTTGCCA CCTGGCTCTG CCGATGTGCC ACCAGCTCGT
GATCCTGATA GCGGTGCCTA CCCACTAGGG ATGGTCTTTA CCGAGATTAA TCGCTTGAAT
TTGCCCCAAG ATGGTGAATT TACGCTTGAT GCAATTTCGG ATAATACCAT TTGGCGCAGA
ACATGTCCCG CAGCCACTGT CGATTGTCCG TTAGTCGTCT GGCAGGCACT CTTCAATAAA
ACGGGGACAT GGAAGGTCTA TTTGGTGATT GATAGTGGTA ATTATGTAAC CGAGGCTGAA
ACACCAGCCG GGCAACGTGA GTTGGATAAC GTTTATTCGT TCAATGTTAA CTCAACGGTT
GTTGGGAGCA CAATTCACAT GCCGGTGGTC GGGATTAACT TCTTGGCTAC GCCACCGCAA
CCATAA
 
Protein sequence
MVRNRKRRRV LSSFLITLTL LVTFISSVSA ISLNATVSLS SGDFASGYFG LTGLTQQDQV 
IGDVTYGGVQ LIPQGALAQW SDASNQLCRT LADMGTVSFK QHLYAIGGST ASSGNVAVVS
EVCRATVTDT GGETTEWTEL PQTLPVPLTR MSTVVVTNTS DPTKGIMYTF GGQSASTGDI
EYTDKIYSNV INADGSLNTW QTQALTTGEK LINTTATAYT TPNGQTYIYL IGGKTRDTSA
LFSPIYVRRS VRRTLVGPNG VLGPWQSMPD LPITPDMFTP TNGCDENVGL HSMDVANFDA
ITLTSTYRAF LVVGGTFELG TGHVAVGCTR TVEGSAQAML GKLDTNGMLT WETQRYILPE
PLSSPRVIGV NQKIYVVGGR QGSAGDPTHR IYTSYINIDN FTLPVFGQSN FRVSENALLT
SQARSGHGLE LIYINFRPVA YMYGGIRVGN TYQQDVLFGF VGTDADIDLT VGGYPSPGVY
RSSPLQLRAP AIIEQMLWDA TLPNPPINTD IQMQYKLAAT RGALETAQWQ TVDASPGNDN
YSVQGPNVAN GTPAVQGQWF QYQALMTTQS PTEVGATPIL RNVRIKYKVD GHPSLYVDSA
TMSTVSTTGI TAFTATFKNG IKPGSNDTEN VLDADIESQG TFFVDMYLLP PGSADVPPAR
DPDSGAYPLG MVFTEINRLN LPQDGEFTLD AISDNTIWRR TCPAATVDCP LVVWQALFNK
TGTWKVYLVI DSGNYVTEAE TPAGQRELDN VYSFNVNSTV VGSTIHMPVV GINFLATPPQ
P