Gene Haur_0315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0315 
Symbol 
ID5732210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp376385 
End bp377740 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content51% 
IMG OID641277439 
Producthypothetical protein 
Protein accessionYP_001543095 
Protein GI159896848 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3591] V8-like Glu-specific endopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAC AAACTCGCCA ACTTTGGCGC TCGACGCTGA GCCTGTTAGG AACGGTCGGA 
TTAATAGCCA GTAGTTTTTC AATCACCACT AGTTCAATTT CGGCCCAGAC GAGCGGCAGT
GCCCAACCAT GTTTGAGTGG CGATCCATTC CAACTGGCCG CGCTCAAAAC CAAACTTAGC
CAGGTCAATC CACCCAGCGG CGTTCAACGA GTCAATCGCA CATTGACGGT CGAATTTATC
ACCACTGGCG TGTTGACTGA TGAAGCGCCA ACTATGCCAG TGCTCCCTAC CGAAGTTGGC
GACGATGAGC CATACGAACG CGAGCCAGTG CCTGCTTCAA CTGCCACCTT CCGCGTGTTC
AACACATTAA CTTTGAACGA ATTTCGGGTG GTGATGCAAG CCAGCACAGT CGGCACAATT
CGCGATTGCT ACGAGCGCAA CGATTTGCTC AATGGCAAAA TACCCACCGA TTACACTGGC
GATATTGATG CACCACCGCC ACCACAACGC AGCAAGCAAA CCGAAGTGTT CACACCGTTT
GGCTGGAGCA ATGGCGACGA TAATCGTGAA TTAAAAACCA ACCATACCCA ATTTCCATTA
CGCACAATCA GCCAATTTTC ACGGGTGAGC GGCAACCAAG ATTCCAACTG TACCGGGACT
TTTGTAGGGC CACGTCACTT GATTACCGCC GCCCACTGTA TCAATCGTGA AGCAACCAAT
GTTTGGTTTA CCACCAAAGT TACGCCTGGC CGGAATGGCA CGGGCACAGG CTCAGCACCG
TATAACTCAA CCGTGATCAT GCCCAATCCA CAGCCACCAG TGGAATCATG GTATTGGACC
TTTGAAGAAT GGCGCGATCC AAACCAAAAC AATCGCACTC GCTGGGACAT TGGGATGATC
GTTGTACCTG ATCGTTTGGG CGATACCACT TCATGGATGG GCGTTGCTCC ACGTACAGCG
ACCTATTTGA AGAATACAAC CAGCTATAAT CGTGGTTACC CTAACTGTAA TGGCGACGGA
GCGACTCGTG GCAATGCTCC AGCTGGCTGT CAAGTTGCTC GAATGTATGG TGATCCTGGC
AATTGTGGGG CACGTTGGTT CAAGAACCTT GATGGTGATG GTTGGTCACG GCGCTACGAT
GTGAAATGTG ATGCCAGCGC TGGTCATAGT GGCAGCCCAG TTTATCATTA TGAATACAGC
GCCCACCACG GCAAAGATAT TCCGGTTGTG TCAGCAGTGA TTATCACCGA AGAATGTTTC
ACCTGTTCGA ATCTGAATTC ATATGTCAAT ACGGTGCGCC GCGTAACGCC TTCAGTCATC
GACAACTATG TCGCCTTACG CGAAATCTTC AACTAG
 
Protein sequence
MQRQTRQLWR STLSLLGTVG LIASSFSITT SSISAQTSGS AQPCLSGDPF QLAALKTKLS 
QVNPPSGVQR VNRTLTVEFI TTGVLTDEAP TMPVLPTEVG DDEPYEREPV PASTATFRVF
NTLTLNEFRV VMQASTVGTI RDCYERNDLL NGKIPTDYTG DIDAPPPPQR SKQTEVFTPF
GWSNGDDNRE LKTNHTQFPL RTISQFSRVS GNQDSNCTGT FVGPRHLITA AHCINREATN
VWFTTKVTPG RNGTGTGSAP YNSTVIMPNP QPPVESWYWT FEEWRDPNQN NRTRWDIGMI
VVPDRLGDTT SWMGVAPRTA TYLKNTTSYN RGYPNCNGDG ATRGNAPAGC QVARMYGDPG
NCGARWFKNL DGDGWSRRYD VKCDASAGHS GSPVYHYEYS AHHGKDIPVV SAVIITEECF
TCSNLNSYVN TVRRVTPSVI DNYVALREIF N