Gene Haur_4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4077 
Symbol 
ID5735935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5204113 
End bp5208009 
Gene Length3897 bp 
Protein Length1298 aa 
Translation table11 
GC content51% 
IMG OID641281228 
ProductTPR repeat-containing protein 
Protein accessionYP_001546837 
Protein GI159900590 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAC CAAACATCCT CACCCTCGAA TGTACCGCTG CCGCCGCTGA TTCGCCCGAT 
CTAGCCCAAT GGACGTTGCG ACTCAATCAA TTGACTGCCA GTGGCCTGTT GCCTGCGCTG
CCAACCAAAG AGCAACGCGC CGATTTTCAA TGGTATTTTG AGCGCTATTT AGATTGGCCG
TTTTTGGAGT TTCGCGAACG CGCCATGCGG GTTGAAGCCG AGTTAGCGAC AGTTGGCAAG
GCACTCTTTG CCGCAATTTT TGAATCAAGC GCCAACGCTG CCAAAATCTA TGCTGAATGG
ATGCAGGTTG ATGCTGGTGT GCCAACCTTG CAAATTTTAA GCCTGATTCC CGCCGTGCTG
AGCGTGCCAT GGGAGTTGCT GCACGATGAT TTTGGCTTTT TGAACCAACG GCGGATCAAT
CCAGTCGCGA TCTATCGCAC CGTTTCCAGC GTTCGCTCAA AAATTGTAGC CCAGCAATTT
GCCATGCCGT TGCGGGTGCT GGTGGTTGTG GCACGGCCTG ATGATCAAAA TTTTCTCGAT
CCGCGCAGCT CAGCCCAAAC CATTTTTGGC CAGTTACAGC AATTAGAAAA CGATCAAAAA
CTCAAGCCTG GCATGATCGA GCTGGAGTTT TTGCGCCCGC CAACCTACGA TCAATTGGCG
CGTCGCCTGC AAGATGCCAC AAAACCTGTA CATATATTGC ATTTCGATGG CCACGGCGGC
TTTCCCAAAA TCCAGCCAAC CAGCACGCTC TATAAATCGG CCAACGTGCC ACAAGGGGTT
TTATCGTTCG AAAAAGCCGA TTATCAGGTT GATACGGTTG AGGCTCGCCG CTTTGCCGAA
TTGTTGAACG GCGCTGGCGT GCGTTTGGTC TTGTTGGATG CCTGCCAAAC CAGCGTGATG
GATACCAGCG CCAGCGACGA TGCCGAGTAT CAACGCCAAC AGGCTTTGAG CAGCGTGGCA
ACCCAGTTGT TGACAGCGGG CGTGCCTGCG GTGGTAGCGA TGAGCGCCAG CGTGATTGTG
CCGACCACGG CGCTCTTTTT TGGCGAATTG TATGGCCTGA TCGCCGAGGG CCAATCTGTG
CCCGTGGCGT TGGAGCGAGC ACGCCAAGCC TTGCAAAGCC AGCCCGTGCG CCTATACCTC
GCCCGTAACG CCGAACAAGA ACCTGATCCA ATTAGCTTGG CCGATTGGTG GTTGCCGCAT
TTTTATCAGC AAGCAGCGAT CAATCTCACC CCAACCGGCA AGCCAGCGCG ATCCAAGCCG
ACCAAGCTTT CGGGCTTCAT CGAAAACCCA CATCAACGCC CATTTGTTGG CCGCGCCAAG
GAGCTATTGC AGCTTGAACG GGCTTTGCTC AAGGGCAAAA TTGCCCTGCT GCATGGCTTT
GGTGGCATTG GCAAAACCCG CCTTGCCAGC GAAGCCGCCG CCTGGCTGAC CCAAACCAAG
CTCTATCAAG GCGCATTGCT GCTCTCGTTT GAACATGGCG GCAACCAAAT TGCCCTGCTC
AGTGCAATTG CCCGCCATTA TGAGTTGCCC GAAACCGATT CGCACGATTT GCAGGCAGCA
TTAAAGCGCT TCAAGCCACA CCTACACAAC AAGCCATTGC TAATTATCGC TGATAATTTG
GAGAGCATTT TGCCCAACGC GGGTAATGTT GAGCTGAATG TGTTGGAAGC CCAAGAACGC
CAAGCCTTGT GGGATTGTGT GCTAGGCTTG CGGCAAGCAG GCGCAGGCAT CATTTTGACC
TGCCGCGATT ATGATTTGCG TGATAGCCGC TTGCAGCAAG GCCAATACAC CAGCGTGATT
GCCATGCGCG GCTTGGATAC CCCCTCGGCT TACAGCTTTG CCACAGCCTT GCTCGATGAT
TTGGCGATTG ATCGGCGGCG TGCGCCAAAA TTACAACTAA GCAGCCTGTT GGCGCGGCTC
GATCATCACC CCTTGGCCAT GGGCTTGACT CTACGCGCCT TGCGCGACCC AGCCTTGAGT
ATCGAGCAAC TGCTGAGCGA CTATAGCAGT GCCTTGCTGC AATACACTGA TGAAACCAGC
CCCAACCAAC GCCATAGCTC ACTCGAAGCC TCGCTTAACT ACTCGCTGCA ACGCCTCAGC
CCTGAACAGC GCGAGTGGCT GGCTAAGCTC GCACCATTTG AAGGCGGTGC AAGTGAAGAT
GATCTTTTGG TGATCACCGA GATTCCTGCC GAGCAATGGG CGCAACTCCG TGCCTCCTTA
GAGCATGCGG CATTAATTGT GCCTGTAGCA ATTCCAGGTT TCGAAGCACC ATTTTTGCGT
TTTCACCCTA CCCTCACACC CTATTTACGC CAGCAGCACC CCGCTAGCAC CGAGCTAGAG
CAACGCTTTG CCGTGCGTTA TTATGGGTTA TCAGGCTATT GTTATCAGCA AGATCGCCAA
AACCCGCAAG CAGTTCGCGC CTTGGTACGC TATGAACTGC CCAACCTACA ACGGGCAATC
CAAGGCTTAT TGCGCTTCAA GGAGATTCCT GCGGCGGTGA ATATGTCTGA TAACCTAAAT
AAGTTTTACA ATAATTTTGG CATGCAACGC GAGCGCAATC TGCTTAATCG GCAAATTGCT
CCGTATATTT CTTCTAGTGA ACAGCTTTCG GAAGCCGAAT ATTTGTATGA AAGTGAGCTT
GGTGAGGTTG AATATAGCCA AGGTCATTAT GAGTTAGCAC TTAAACGTTT TCAGCAGTTA
CTAGCACGGA TTGAGCAACA ATTACCCGAT CAACTGCTTA ACTTTCAGCA TTGCATAACC
CTTACCCACA TTGGGCGATG TTTGCAGCGA ATGGGGCAAT CTACTCAATC GCAGCGAACT
TATGAACAAG CCTTAGCGGT GATTGAGCAA TTGCTCAATC ACCGCTTGGA TGATCGAGAC
TACCTCAAAC AACAAGCGCT GCTAATAACA GCGCTTGCTG ATTGCCACAC CGATCAAGGC
CAATTTGCCC AAGCCAAAAG CTCCTATGAG CAAGCGCTAG AAATCAAAAA AACCATTGAC
GATCGGAGGG GTCAAGCGGT AAGTTTAGGC CAACTTGGGC TGTTAGCGCT ACGCCAACGT
CAGTATGCTG AGGCCACTCA AAAATACAAT GCAGCGCTAG CAATCTATCA GCAGTTGAAC
GAGCCTAGTA TCGTAGCTTC TATCTATCAT CAACTTGGCA GAGTTGCCGA AAAGCAAAAG
AACTGGACTA TAGCAGAGAA TTACTATCGC CAGAGTCTTA CAATCAAAGA AGGTTTACAC
AATAACATTG GAGTCGCAGA AACATGTAAT CAACTAGCAC TTTTGGCAAA AAGTGCTGGT
AGATCTATTG AGGCAGAGGG CTGGTTTAAA CGGGCTTTAC AAAATCCTGA TTTACCAGTG
TTAAATCGCG CCAAATGGCT CAATAATCTT GCCGATCTCC TTGCCGACCA AATTCAAGCG
GGGACTTGGT CGCAGAGCCG CTTGGCTGAA GCTCAAAACT ATGCAGAACA AGCACTAAAT
ATTCTTAAAG GGGTTGATCC CGCACAAGCT TCTTTATGGT TACCGCTTAA TATTCTTGCC
AACTTAGCCG ATCCAGCAGG CCAAGCAGCA GAGTATTGCC AGCAAGCCCG CATTGCCTAT
GCCGCCCATG CCGCCAATCG CTGGCATATC GATCAACAAT TTGGAGATTT GATCGCGGCG
ATTGTCACTG CAACTCAAGG CAATCAAGAA GTACGCACAG CAGTCGAGCA AACCTTACCG
CAATTAGAAG CAAATGGTTG GAAGATCAGC AAGGCTATTC AGCGCATCTG GGCAGGCGAG
CGCGATTGGC ATGGGTTGTG TGCCGAGCTT GATAATCAAG ATTCATTGTT GATCTTACGG
GTGCTTGAGG AGCTTGAAGC CAATTCAGAA GGCAAAAGTC AAAAGGATAA GGGGTAA
 
Protein sequence
MSQPNILTLE CTAAAADSPD LAQWTLRLNQ LTASGLLPAL PTKEQRADFQ WYFERYLDWP 
FLEFRERAMR VEAELATVGK ALFAAIFESS ANAAKIYAEW MQVDAGVPTL QILSLIPAVL
SVPWELLHDD FGFLNQRRIN PVAIYRTVSS VRSKIVAQQF AMPLRVLVVV ARPDDQNFLD
PRSSAQTIFG QLQQLENDQK LKPGMIELEF LRPPTYDQLA RRLQDATKPV HILHFDGHGG
FPKIQPTSTL YKSANVPQGV LSFEKADYQV DTVEARRFAE LLNGAGVRLV LLDACQTSVM
DTSASDDAEY QRQQALSSVA TQLLTAGVPA VVAMSASVIV PTTALFFGEL YGLIAEGQSV
PVALERARQA LQSQPVRLYL ARNAEQEPDP ISLADWWLPH FYQQAAINLT PTGKPARSKP
TKLSGFIENP HQRPFVGRAK ELLQLERALL KGKIALLHGF GGIGKTRLAS EAAAWLTQTK
LYQGALLLSF EHGGNQIALL SAIARHYELP ETDSHDLQAA LKRFKPHLHN KPLLIIADNL
ESILPNAGNV ELNVLEAQER QALWDCVLGL RQAGAGIILT CRDYDLRDSR LQQGQYTSVI
AMRGLDTPSA YSFATALLDD LAIDRRRAPK LQLSSLLARL DHHPLAMGLT LRALRDPALS
IEQLLSDYSS ALLQYTDETS PNQRHSSLEA SLNYSLQRLS PEQREWLAKL APFEGGASED
DLLVITEIPA EQWAQLRASL EHAALIVPVA IPGFEAPFLR FHPTLTPYLR QQHPASTELE
QRFAVRYYGL SGYCYQQDRQ NPQAVRALVR YELPNLQRAI QGLLRFKEIP AAVNMSDNLN
KFYNNFGMQR ERNLLNRQIA PYISSSEQLS EAEYLYESEL GEVEYSQGHY ELALKRFQQL
LARIEQQLPD QLLNFQHCIT LTHIGRCLQR MGQSTQSQRT YEQALAVIEQ LLNHRLDDRD
YLKQQALLIT ALADCHTDQG QFAQAKSSYE QALEIKKTID DRRGQAVSLG QLGLLALRQR
QYAEATQKYN AALAIYQQLN EPSIVASIYH QLGRVAEKQK NWTIAENYYR QSLTIKEGLH
NNIGVAETCN QLALLAKSAG RSIEAEGWFK RALQNPDLPV LNRAKWLNNL ADLLADQIQA
GTWSQSRLAE AQNYAEQALN ILKGVDPAQA SLWLPLNILA NLADPAGQAA EYCQQARIAY
AAHAANRWHI DQQFGDLIAA IVTATQGNQE VRTAVEQTLP QLEANGWKIS KAIQRIWAGE
RDWHGLCAEL DNQDSLLILR VLEELEANSE GKSQKDKG