Gene Haur_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2040 
Symbol 
ID5733929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2543870 
End bp2546644 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content59% 
IMG OID641279184 
ProductTPR repeat-containing protein 
Protein accessionYP_001544811 
Protein GI159898564 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.962873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAACG CCCTTCCGGA CGCTGTTTTG GCCAAACTCC ATGCCGTGTT TCCCGACCTT 
GCCGCTGCCG ACCTGCATGC CATCCTTGAA CGGGTGCTCA GCGGTGAGCG GGTCGCCATT
GCCGATAAAG TCCTAAGTGT GCAGACGGAT CGCCATGGAC AGGCGATTGC GTTGGGCTGT
ACCGTGCTTG GGGATTTAAC CCAAGTGGTG TTCCAGATCC ACCTCCCCGA ACCCATTGAC
CTGCTGCCTG CGGCCCTTGA GCGCTTGGCG ACCCTCCCGC TCGAGACGAT TCCTGAGCCA
CGGATGGATC TGCCTGCGCG GTCGCACCTT GATTTCCGCC CAAATCCTAA TTTTGTTGGG
CGGGCCGCCG CTTTTCGTGA TCTGGCAGCA GCCCTCAAAC GGCATCAAAC CGCTATTTTA
ACGCCAGCGG TGGCCGTGGG TGTGGGCGGG ATTGGGAAAA CGAGCCTTGC GGTGGAGTTT
GTGTATCGGT ATGGCTGGTA TTTTGCGGGC GGGATTTTCT GGATTAATAG TGCTGATCCG
ACGCAGATCG CCAGTCATGT GGCGGCCTGC GCTCCGGCCT TGGGGATTGA TCCCCGGGGG
ATGCCGCTTG ATGAGCAAGT GCACCAGGTC CTGCTTGCAT GGCAAGCAGC GATGCCACGC
TTGATCGTCG TCGATAATTG TGATGATGCG AAGGTCATTG ATGGATGGCT ACCCGCGATT
GGGGGCTGTC GGGTGCTCAT AACCTCGCGA TCCGATCAAT GGGCCAGTGT CCCACTGGTT
CGGGTGGGAT TGCTCTCCCC GCGTGAGAGT CGCGCATTGG TACAGCGCCT CTGTGCACGG
TTGACCGACA TCGAGGCCGA TGCGATTGCG GAAGATGTCG GCTATTTACC ACTGGCGTTG
CATCTCGCTG GGAGTTATCT CACCGCCTAT TCCCATCATA CGGTCGAACA ATACCGTAAG
GATTTGACGA TTGCTCATCG CTCGCTCAAG GGACGAGGAG CGTTGCGTTC ACCGACACGG
CATGAACAGG ATATTGAAGC CACCTTTATG CTGAGTTTTA ACCAACTTGA TCCGACCAAT
GCCCTTGATG CGTTGGCCTT AGGTATGCTT GATGGGGCGG CGTGGTGTGC GCCAGGCGTG
CCAATCCCGC GCGATCTGGT ACTGGCATTC GTGCCGGATG AGACGGATGC TGATGATGCG
GTTGATGCGT TACGGCGTTT GCAGCAACTC GGCTTGCTGG ATGGAGCAGA TGCTGTGGTG
TTGCATCGCT TGTTGGCCCA AGTCGTCGAG GCACGATTAG GATCGACGGA AACGCTAGCC
ATGGTGGAAG ATCGGATTGA TGCTGTGGCA TCCCGTGTTA ATGAGATGGG TGTGCCACGT
TCGATGCTGT CGCTTGAGCC ACATCTACGC CACACCACCA CACGGGCGCT GAAGCGTGGT
GATGCCCTAG CCGCACTTCT TGCCAATAAC CTAGGGTATT TTGAAGCCCT GCGGGGAGCC
TATGCCGACG CACAGCCCTT GTATGAACGG GCCTTGGCCA TCAGGGAAGC GGTGTTGAGG
GCCGATCATC CCGATACGGC ACAGAGTGTG AACAATCTGG CCTCGGTCTT GTTGCATCAA
GGGCGGTATG CCGACGCACA GTCCTTGTTT GAACGGGCCT TGGCGGTGCG GGAAACGGTG
TTGGGGGCCG ATCATCCCGA TACGGCGACG AGCGTGAACA ATCTGGCGTT TGTCTTGGAG
CGTCAAGGGC GGTATGCCGA CGCACAGCCC TTGTTTGAAC GGGCCTTGGC CATCAGGGAA
GCGGTGTTGG GGGCCGATCA TCCAGCGACG GCGGTGAGTG TGAACAATCT GGCGGGGGTA
TTGTTGCGTC AAGGGCGGTA TGCCGACGCA CAGCCCTTGT TTGAACGGGC CTTGGCCATC
AGGGAAGCGG TGTTGGGGGC CGATCATCCA GCGACGGCGG TGAGTGTGAA CAATCTGGCG
GGGGTATTGT TGCGTCAAGG GCGGTATGCC GACGCACAGC CCTTGTTTGA ACGGGCCTTG
GCGATTTATG AAGCGGTGTT GGGGGCCGAT CATCCCGATA CGGCGGTGAG TGTGAACAAT
CTGGCGGGGG TATTGGATCG GCAAGGGCGG TATGCCGACG CACAGCCCTT GTATGAACGG
GCCTTGGCGA TTTATGAAGC GGTGTTGGGG GCCGATCATC CCGATACGGC GGTGAGTGTG
AACAATCTGG CGGGGGTATT GGATCGGCAA GGGCGGTATG CCGACGCACA GCCCTTGTAT
GAACGGGCCT TGGCCATCAG GGAAGCGGTG TTGGGGGCCG ATCATCCCGA TACGGCACAG
AGTGTGAACA ATCTGGCGAT GGTCTTGGCG AGTCAAGGGC GGTATGCCGA CGCACAGCCC
TTGCATGAAC GAGCCTTGGG CATCTATGAA GCGGTGTTGG GGGCCGATCA TCCAGCGACG
GCGGTGAGTG TGAACAATCT GGCGGGGGGC TTGGCGAGGC AAGGGCGGTA TGCCGACGCA
CAGCCCTTGT TTGAACGGGC CTTGGCGATT TATGAAGCGG TGTTTGGGGC CGATCATCCC
GATACGGCGG TGAGTGTGAA CAATCTGGCG GAGGTCTTGG AGCGACAAGG ACGGTATGCC
GACGCACAGC CCTTGTTTGA ACGGGCCTTG GCGATTTATG AAGCGGTGTT GGGGGCCGAT
CATCCCGATA CCCAGTTTAT TCGTGCGAAT CTCGTCTCTT TACAAGCCAC CATGACGGCA
GACGACCATC TGTAG
 
Protein sequence
MTNALPDAVL AKLHAVFPDL AAADLHAILE RVLSGERVAI ADKVLSVQTD RHGQAIALGC 
TVLGDLTQVV FQIHLPEPID LLPAALERLA TLPLETIPEP RMDLPARSHL DFRPNPNFVG
RAAAFRDLAA ALKRHQTAIL TPAVAVGVGG IGKTSLAVEF VYRYGWYFAG GIFWINSADP
TQIASHVAAC APALGIDPRG MPLDEQVHQV LLAWQAAMPR LIVVDNCDDA KVIDGWLPAI
GGCRVLITSR SDQWASVPLV RVGLLSPRES RALVQRLCAR LTDIEADAIA EDVGYLPLAL
HLAGSYLTAY SHHTVEQYRK DLTIAHRSLK GRGALRSPTR HEQDIEATFM LSFNQLDPTN
ALDALALGML DGAAWCAPGV PIPRDLVLAF VPDETDADDA VDALRRLQQL GLLDGADAVV
LHRLLAQVVE ARLGSTETLA MVEDRIDAVA SRVNEMGVPR SMLSLEPHLR HTTTRALKRG
DALAALLANN LGYFEALRGA YADAQPLYER ALAIREAVLR ADHPDTAQSV NNLASVLLHQ
GRYADAQSLF ERALAVRETV LGADHPDTAT SVNNLAFVLE RQGRYADAQP LFERALAIRE
AVLGADHPAT AVSVNNLAGV LLRQGRYADA QPLFERALAI REAVLGADHP ATAVSVNNLA
GVLLRQGRYA DAQPLFERAL AIYEAVLGAD HPDTAVSVNN LAGVLDRQGR YADAQPLYER
ALAIYEAVLG ADHPDTAVSV NNLAGVLDRQ GRYADAQPLY ERALAIREAV LGADHPDTAQ
SVNNLAMVLA SQGRYADAQP LHERALGIYE AVLGADHPAT AVSVNNLAGG LARQGRYADA
QPLFERALAI YEAVFGADHP DTAVSVNNLA EVLERQGRYA DAQPLFERAL AIYEAVLGAD
HPDTQFIRAN LVSLQATMTA DDHL