Gene Haur_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1613 
Symbol 
ID5733515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1873480 
End bp1874544 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content51% 
IMG OID641278752 
Productsortase family protein 
Protein accessionYP_001544384 
Protein GI159898137 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCGAC GACTACGAGC CTATTTGGTG ATTGTTGGCC TGTTGGTAGC TACCGTAAGT 
GCTGGCTCAG CCGAAACTGC GGCGGGACAG CCGCGCTACT TTGCCGAAAC TGGCCATAGT
TTGGCCTATA ATTTTCGGCT ATTTTGGGAG CGCAATGGTG GCTTGCCAAT TTTTGGCTAT
CCAATTACTG AAGTGTTCGT TGAAAATGGT CGTCCAGTAC AATATTTTGA GCGAGCGCGG
CTGGAGTGGC ATGCAACCAT TGGCTGGACG CTAGCCGGCC ATCTGGGGCA TTGGGCAGCC
GAAGGCTCAG CTAAACATCC AGCCTTCACG CCGCGCAGCG AAGCCGCCTA TCCTGGTCAA
ATCTTCTTCC CTGAATCGGG GCATACCCTA GGTGGGCTGT TTCGCCAGTA TTGGGAGCGC
AACGGTGGGT TGCAAGCGTT TGGCTATCCG TTATCGGAAG AATTTCTCGA GCGCAATCAA
CAAGATGGCC AAATTTATAC GGTGCAATAT TTTGAGCGCA CACGCTTTGA ATATCACCCT
GAATTGCCAG CAGCTTTTCA AGTCTCGTTG GGCCATTTAG GTCGCCAATA TTTGAATGCT
ACTAAGGCTG CGCCGGAATG GGCTACCCGC AAAGTCAATA ATGCTGATGC AGCGTGGCAA
GCATTACGGC CAACTCGTAT CAGCATTCCA CGAATTGGGC TTGATAGTAC GATTGTTGAA
GCAGGTTTTT CGTTGGGAAC ATGGGACGTA CCAACCGATG CTGCGGCCCA TTATTGGCCA
GTGGCAAGTT TTCCAACAAC GGCTGGGAAT ATAGTACTAG CAGGTCATGT CGGCTATCAT
GGTATTATCT TCAGTCAGTT ACCGAATGCA GTCGTCGGCG ATCGCTTGAT CCTGACTGTT
GATGGGGTAG AACACCGCTA CCAAGTAACT GACATAAGTA CTGTGACCCC CGACCAAACA
TGGGTAATGG AGCCAACCGC TGAAGAAACG GTGACGCTAA TTACCTGTGT GCCGATCGGT
GTGTATTCGC ATCGCCTAAT TGTGCGTGCG AAGCCCCAAC CGTAG
 
Protein sequence
MLRRLRAYLV IVGLLVATVS AGSAETAAGQ PRYFAETGHS LAYNFRLFWE RNGGLPIFGY 
PITEVFVENG RPVQYFERAR LEWHATIGWT LAGHLGHWAA EGSAKHPAFT PRSEAAYPGQ
IFFPESGHTL GGLFRQYWER NGGLQAFGYP LSEEFLERNQ QDGQIYTVQY FERTRFEYHP
ELPAAFQVSL GHLGRQYLNA TKAAPEWATR KVNNADAAWQ ALRPTRISIP RIGLDSTIVE
AGFSLGTWDV PTDAAAHYWP VASFPTTAGN IVLAGHVGYH GIIFSQLPNA VVGDRLILTV
DGVEHRYQVT DISTVTPDQT WVMEPTAEET VTLITCVPIG VYSHRLIVRA KPQP