Gene Haur_2853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2853 
Symbol 
ID5736890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3621044 
End bp3622129 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content44% 
IMG OID641279996 
Productpeptidase C2 calpain 
Protein accessionYP_001545619 
Protein GI159899372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0973219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCGA CCGTTGTAGA GATTAATTAT GCGCAGATGC AACAACTTAG TCAGCGCTTC 
CAACGTCAAG CTGAGGTAGT AAGCCAGTTA CAGCAACAAC TCAATCACAC TTATCAACAA
TTACAAAACG ATTGGCATGG CGATGCCGCT AAAAGCTTTT TTAATGAAAT GCAAACCAGC
ATCTTTCCAA CCTTTGGAAA ATTAAAAGAA GTTTTAGTAA CAGCGCAACA AGTCACATTA
AACGTTAACT CGATTTTACG CGAAGCTGAA ACCGAGGCTG CCAACTTGTT CCAAGGAGCA
TTTGATGGCG GGGCCGCAGC AGGCAACGGC AAAGGTATCT ATGAGGCTAA TCCTGCTGGA
AAAAGCAAAT TGGTAACCGA CCCAGAGTAT CGCAAAATTG AGCAGCCAGC GTTTGCTCAA
GATGCTGACG ATAGCGCCGA TATTGCCATT GATGATGTTA AACAAGGTCA GTTGGGTGAT
TGCTATTTGA TGGCCGGAAC CGCAGCAATT GCCAATACGC GCCCTGATAT TATTCGCAAT
GCGATTCGTG ATAATGGCGA TGGAACCTAT ACCGTGACGC TGTATCCAGA AGAAGGTGTT
TCAGGCTTTT TTGGGATGCG CTCCAAGGTA GAGGTAACTG TAACCAATGA ATTTGTTCAT
TCAAAAGGTA GCGGTGCGCT TGGTTATGCC CAATTAAGCG ACGAATTTGA AATTTGGCCA
ATGCTCGTCG AAAAGGCCTA TGCTCAACAT AAAGGCGGCT ATGCCAATAT TGTCAGTGGT
AATGCAGGCG AGTTTATGGC AATCCTGACT GGCAACGATT CATCGCATAC CGATGTAGAA
GATGTTGATT TTGCCGATCT CAAAAGCCGC TTGGACAACG GTGCTGCAAT TACTGCCGGA
ACCCCCGATT CGCTGACGAA TAAACCAGCC GGAGTCCATG CCGATCATGC TTATGTCATT
AAAAGTATTG ATCCAACCAA TAAAACCGTT ACCTTATATA ACCCTTGGGG TTATGATCAC
CCAACAATTA CCTTTGATGA ATTTAAAGCC AATTATGAAA CCGTATCAAT TAATGAAAAG
GATTAA
 
Protein sequence
MSATVVEINY AQMQQLSQRF QRQAEVVSQL QQQLNHTYQQ LQNDWHGDAA KSFFNEMQTS 
IFPTFGKLKE VLVTAQQVTL NVNSILREAE TEAANLFQGA FDGGAAAGNG KGIYEANPAG
KSKLVTDPEY RKIEQPAFAQ DADDSADIAI DDVKQGQLGD CYLMAGTAAI ANTRPDIIRN
AIRDNGDGTY TVTLYPEEGV SGFFGMRSKV EVTVTNEFVH SKGSGALGYA QLSDEFEIWP
MLVEKAYAQH KGGYANIVSG NAGEFMAILT GNDSSHTDVE DVDFADLKSR LDNGAAITAG
TPDSLTNKPA GVHADHAYVI KSIDPTNKTV TLYNPWGYDH PTITFDEFKA NYETVSINEK
D