Gene Haur_5119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5119 
Symbol 
ID5737077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp159319 
End bp160482 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content63% 
IMG OID641282284 
ProductNLP/P60 protein 
Protein accessionYP_001547875 
Protein GI159901629 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.553866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACCC GCTTCCGACT CGTCGGACTT GTCATAATCG TTTTCTTGAG TGGTTGTGGA 
GGTCGCCCGC TTCCGGCGGC TTCCACCCAT CCCATCACGG GCAAGCCGCA GTGGTGGTGT
CCAACTCCGG TCATGGCAGG CGATCCGACC GCGATTGCCG CCATGCCGAC GGTGACACCC
TACTATCACC GTGACCAGTT CATGCTTGGC CAAGACGTGC TCAGCAATGG GTTGCGCGTG
ACCGTGCATG GCATCACCAG CGGCGAGGAA GCGCCTGAAG CCATTGGCGG GGGCCAGGTG
CAGTGGGTCG ATCTCGAACT CACCAGCGCG GTGTCGTTGC CGCTTGATCT GGCGGCGCAG
GTCGTCATTC GTGAGGTCGA GCAGGAAGCA GGACAAGCGG CACGCGGCTG GTGGACAACC
GACACCGCGA CGCTGGCCAC GACCGCGATT ACCCTGCCCA CGCGGCTGGA AGCAGGCATT
TCATGGCGCG GGTCGATTCC GATTCGTACC CCGATTGGTA CGCCCGTGTT TGTCCTGATC
TACCGCACGC CTGCCGATGC GCTGCTGCGG GAGCAGCCGA CCGATGGCGT AATCGTGGTG
CAGAATCGCC GCGACCCGAC CTGTGCGGGC AATATCGCGC GGGTTCCCTT CCCGACCATG
CCCGCAGGCG GTGGATCAGG TGCGCCGATT AACGGGACAC CCATCGCCGT TCCACCAGGC
ACGAATCCAC TCGTGGCTTA CGCGGTCAGC AAGCTGGCAT GGCCCTATGT CTGGGGTGGC
GAAAGCGAGG CCGAAGGCGG CTTTGACTGT TCAGGGCTGA TGTACGCCGC CTATGGCAGT
GTCGGCCTGA CGATTCCGCG CACCTCACAG GCGATGTGGC AGAGCGCCCA GCTGCAACGG
ATTGGCATCA GCGAGCTGCG ACCGGGTGAT CTGGTCTTTT TCCACACCGA TAGCAGCCGC
TTTAGCAGCC CGCCAACCCA TGTGGGGATG TATATCGGTG ATCTGAATGG CAATGGCACA
CCCGATTTAG TCCATGCCCT CAGTCCGGCG TGGGGTATTC GGATTGAGGA TAACTGGCTC
ACCAAGCCGT GGCTGGTGGC CCCATGGCCC GATGGCACGC CCCGTTTGTG GGGGGCTGGC
TACTTTGTGA ATCCGTATCG GTAG
 
Protein sequence
MSTRFRLVGL VIIVFLSGCG GRPLPAASTH PITGKPQWWC PTPVMAGDPT AIAAMPTVTP 
YYHRDQFMLG QDVLSNGLRV TVHGITSGEE APEAIGGGQV QWVDLELTSA VSLPLDLAAQ
VVIREVEQEA GQAARGWWTT DTATLATTAI TLPTRLEAGI SWRGSIPIRT PIGTPVFVLI
YRTPADALLR EQPTDGVIVV QNRRDPTCAG NIARVPFPTM PAGGGSGAPI NGTPIAVPPG
TNPLVAYAVS KLAWPYVWGG ESEAEGGFDC SGLMYAAYGS VGLTIPRTSQ AMWQSAQLQR
IGISELRPGD LVFFHTDSSR FSSPPTHVGM YIGDLNGNGT PDLVHALSPA WGIRIEDNWL
TKPWLVAPWP DGTPRLWGAG YFVNPYR