Gene Haur_4689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4689 
Symbol 
ID5736536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5987165 
End bp5990278 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content51% 
IMG OID641281853 
Productglycosyl transferase family protein 
Protein accessionYP_001547448 
Protein GI159901201 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0744] Membrane carboxypeptidase (penicillin-binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCAAA CTCGAAATGT AATCTCACGG CGACAACGCC GCACTACACG ATTTATTCCC 
TCACGGCTTG GCAATAAGCA AGCTCCACGC CCTATGGGCC GCCGCATTGT GCTGGCCTTT
GTGGGCTTGT TGGTGGCTGG GCTCGTGCTC ATGGGCGTGG CTGGGGTGGC CATGGCCGTT
ACCTACAACG GTATTGCCGC CAACCTCAAG CCACGGCTTG ATCAAATTCA TACCTATACG
GCTTTTCAGC CATCAAAAAT TTACGATCGC AATGGCACGC TGCTATATGA ATTTGTTGGC
GAAGGTCGGC GTACACCAGT TAAACTTGAA GAAGTTTCTA AGCATTTGAT TAACGCGACG
GTTGCCGCCG AAGATGCCTC GTTCTTCGAA AACTCTGGTG TGAACTATTT CAGTATTGCG
CGGGCAACCT ATGCCAACCT TACCCAGCAA AGTGTTGGGG CTGGCGGTGC TTCAACCATT
ACCCAGCAGG TTGTGCGCTT GATCGTACTG ACCACCGAAG AGCGTCAAGA TCCCAACGTC
TATAGCCGCA AGGTCAAAGA AATTATCTTG GCCCAAGAGT TGAACACGGT TTATAGCAAA
AACGAAATTC TCGAACTGTA TCTGAACGAA ATTCCCTATG GCAACTTGTC GTATGGCATT
CAGGCCGCCG CCCAGAATTA TTTCGGGGTT GATGCCAAAG ATTTGGATAT TGCTCAATCG
TCGTTGCTCG GCGGGATTCC CCAATTGCCC ACGACCTATA ACCCCATGCC GTGGCTCGAC
GATAATTTGC TGCTCAAAGG AATTAAATTA CCCAAAGATG TTTGGATTGA TCCGCTCTAC
GATTTGAGCA ATGACATCAA AGGCGAGATT GCCCCACCCA AAGGTCGCCA AATCGAAGTG
CTGCGCCAAA TGGTCAAAAA CAATTATTTG ACTGAACGCG AAGCGCGAGC AGCAGTGGCT
AAAGATTTAC AGTTTGCCAA GCCTGAAGTC AGTTTATTAG CACCACACTT TGTCTTTTAT
GTCAAAGATT ATTTGCAACA ACGCTATGGG GCTGAGGTGG TTTCAAATGG TGGTCTCAGC
ATCACCACCA CCCTCGATTT GGAAACCCAA AATCTAGCCC AAACCATCGC CTATACTCGT
ATTCAAGAAC TTAACGCCGA TAATCGCAAT ATTCACAATG CTGCGGTGGT GGTGATGCAG
CCCAACACTG GCCAAATTTT GGGCATGGTT GGCTCGATTG GCTATGATCT TTCCGAAACC
ACCACAACCC CCGGCGAAGA GGGCAACGTG CTTGATGGCA AAGTCAATGT CACCACTGCT
TTGCGCCAAC CAGGCTCGGC TTTGAAACCA TTCACCTATC TTTCGGGGAT GGAGCAATAT
GTGGCGACCG ACGGCGCACG CGGGATCACC CCTGCCAGTG TGTTGTGGGA TGTGCCAACG
ATTTTCAACC CACGCGGGGT CAAATACGAA CCACAAAACT TCGATAATCA ATTTCATGGG
CCATTACGGG CACGCACTGC AGTTGCCAAC TCGCTGAATA TTCCAGCAGT CAAGGGCTTG
AAGGCTGCTG GCATTCCCGA AACCCTTGAT CTATTGCATC GTTTGGGCAT TTCGCCGAAT
GTTTTGGCTA ACGACCCAGG CTATTATGGC TTGGCGCTGA CCCTTGGTGG TGGCGAAGTT
ACCCCATTGG ATTTGGCGAC AGCCTACAAT ACGGTTGCCA GCGGTGGTCG CTATTTTGCG
CCAACCCCAA TTCTCAAAAT TACCGATGCT CGTGGCAAAA CCTTGGAAGA ATTCAAGCCC
ACGCCATTGG CCAACCCCGA AAGCGATGCG GTCAGCGATA CTAGCAAGTG TGTGATTCCT
GAAGGCGAGG ATTATCAATT GGGTGCGCGA GTTCCCAATG GAACCCAATG TGTTGATGGT
CGCTTGAACT ACATCATCAC CAACATGATC AGCGATAACG AAGCACGTCG CCCAATCTTC
GGTCTGAATA GCATTTTGAA GCTCTCGCAA CCATCAGCGG TCAAAACTGG GACGACCAAC
GACTTCCGCG ATGCATGGGC TTCGGGCTTC ACGCCATTCG TTACCGTTAC AGTCTGGACG
GGCAATAACA ATAACGAACA AACCGCCCAA GTTGAAAGTA CCCAAGGCGG TGGCGTGATT
TGGGCTCGCA CGATGGAAGC CATTTTTGCC AATGAACAGA TCATGAATCG CTTGGCAGGC
TTCTATGGTG GCATCGAAAA TATGCCCCAA AGCTTCGAAA AATCGTATCC CGGGGTCTAT
CGCGAGAGCA TTTGTGAAAT TCCCGGGCCA TTCGGTGGTC GCACCGACGA GCTGTTTATT
GATGGCTTGG ATGCTGGCGG TAAGTGCGAT CTCTACGAAA AAGTCTCGGT TGTGCGGCTA
ACCGTCACCG ATGCTGAGGG CAAAGAAACC ACGACCTACT GCCGCCCAGT CGAAGGTGCT
GAGTATCCCG AAGGCGCAAT TTCATCAATT TATGTTTGGA AGTTGCCCGA AAGCAATGAT
GACGAACGGA TCGATCTCAG CAAATGGAAG GGCTATACAT CGGATCGTGG CAATAGTGGC
AACGACGAAG ATGAGCCAGT GGCGCTTGAT CCGGATAAGT TGCCTGGCTG CGATACGATT
GCTCCAACCC CAACTCCAGG CACACCAACG CCTGATCCAT TGACCCCAAC CGTACCAGTG
CTGGGGCCAG GTCAAGTTTT GATGCCTAAT TTGGTTGGCT ATGGCGAAAA TCAAGCTCGC
CAACAGTTGA TGAGCTTAGG CTTTGCGCCT GATAAGATTG TGGTCGATTA TCAAGGCCGC
GATCGACTTG GGCCAGTTTT TGATCAATAT CCAGCCTATG CTGTGGTCAG TAGCCTGCCC
GGGGTTGGCT CGGTGGTTGA TCTGAATACT GTGATTATTT TGGGCATTCG CTCGCCTGAT
GGCAGCCAGC CAACCACGGC TCCACCAGTA ACAGGCCAAC CAACGACGGC TCCGCCGCCA
GTTAACCCAA CTCCGGCATT ACCATTGCCG ACGACGATTA TTATTCAGCC GTCGCCAGTT
GAGCCATCGC CTGTGCCTGA ACAACCCCAA CCAACTCCAG TGGTAACACC CTAA
 
Protein sequence
MRQTRNVISR RQRRTTRFIP SRLGNKQAPR PMGRRIVLAF VGLLVAGLVL MGVAGVAMAV 
TYNGIAANLK PRLDQIHTYT AFQPSKIYDR NGTLLYEFVG EGRRTPVKLE EVSKHLINAT
VAAEDASFFE NSGVNYFSIA RATYANLTQQ SVGAGGASTI TQQVVRLIVL TTEERQDPNV
YSRKVKEIIL AQELNTVYSK NEILELYLNE IPYGNLSYGI QAAAQNYFGV DAKDLDIAQS
SLLGGIPQLP TTYNPMPWLD DNLLLKGIKL PKDVWIDPLY DLSNDIKGEI APPKGRQIEV
LRQMVKNNYL TEREARAAVA KDLQFAKPEV SLLAPHFVFY VKDYLQQRYG AEVVSNGGLS
ITTTLDLETQ NLAQTIAYTR IQELNADNRN IHNAAVVVMQ PNTGQILGMV GSIGYDLSET
TTTPGEEGNV LDGKVNVTTA LRQPGSALKP FTYLSGMEQY VATDGARGIT PASVLWDVPT
IFNPRGVKYE PQNFDNQFHG PLRARTAVAN SLNIPAVKGL KAAGIPETLD LLHRLGISPN
VLANDPGYYG LALTLGGGEV TPLDLATAYN TVASGGRYFA PTPILKITDA RGKTLEEFKP
TPLANPESDA VSDTSKCVIP EGEDYQLGAR VPNGTQCVDG RLNYIITNMI SDNEARRPIF
GLNSILKLSQ PSAVKTGTTN DFRDAWASGF TPFVTVTVWT GNNNNEQTAQ VESTQGGGVI
WARTMEAIFA NEQIMNRLAG FYGGIENMPQ SFEKSYPGVY RESICEIPGP FGGRTDELFI
DGLDAGGKCD LYEKVSVVRL TVTDAEGKET TTYCRPVEGA EYPEGAISSI YVWKLPESND
DERIDLSKWK GYTSDRGNSG NDEDEPVALD PDKLPGCDTI APTPTPGTPT PDPLTPTVPV
LGPGQVLMPN LVGYGENQAR QQLMSLGFAP DKIVVDYQGR DRLGPVFDQY PAYAVVSSLP
GVGSVVDLNT VIILGIRSPD GSQPTTAPPV TGQPTTAPPP VNPTPALPLP TTIIIQPSPV
EPSPVPEQPQ PTPVVTP