Gene Haur_4509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4509 
Symbol 
ID5736360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5773696 
End bp5774787 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content50% 
IMG OID641281672 
Producthypothetical protein 
Protein accessionYP_001547269 
Protein GI159901022 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0202625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCAAA CCATTGTTCG CTATCGTTGG CTGTTGGTCT TCGTTGGTTT TTTGGTAGCG 
GTCCCAATCG CTGCTCAAAC GGCGGAGAAA AATAAAGGGG CTGATTGGTT GCTGACCCAA
CAGCAGTCGG ATGGCAGTTT TGATTCGTAT TATCACTCTC CACTTGATCC AACGGCATGT
TCGGTGTATG CCCTCCATGC GGCGGGCTAT CAAATTGATC CAGCCACGCA ACGCTTTATT
GAGCAACAGG CTCAGAGCTA TATTGGGCAT CCGGCTAAAG CTTCGGCCAT TGTCATGGCT
CAATTGCTAA CTGGTCATGA TCCCCGTTCG GTTGGTGGGG TCGATTTGGT TGAAGCAATC
ATCAAGAGCT ACGATCCTGC AACGGGCATG TATGGTGAAA ACTTGTATGA GAATTCCTTG
GTAATGATGG CCTTGAAAGC TGCTGGCGAG GCGATTGAGC CACAGGCAAT TCAGACGATT
CTTGATCAAC AGTTGGCTGA TGGGTCGTGG AGTACTAGTA CGCAAAACAC CGCCTTACAA
ATTCAGGCCT TAGTCGCGGC TGATCAAAAG CAATCTGCGG CAATTCCGGC AGCTTTGGCC
TTCTTGCAAA CTCAGCAAGA TTTTGATCGC GGGTTTATGA ATAATCGCGA CTTTGAGCCG
ACAGCGTTTA AGGATGCAAT TTCAACGTCC TTGGCCATCC AAGCTATCTT AGCAACTGGT
GGCGACCCCA AGGCTGAACC ATGGGGCGAT AATATCAATA ATCCTATCAA TGCTTTGCGG
CGTTTGCAGC TTGCTGATGG TGGCTTTCGG CTCGATTCAT CGACCCCACA GCCTGATACG
ATGTCAACTT GTTCGTCTGT TCAGGCTGTG CTGCAAAAAA CCTATCCATT TATTGAACTT
GCCAACATTG GCATTACGCT TACCCCGACC TTAGCTCCTG CCGATGGCTC GACAGCCACT
CCCGTTCAGC AACCTGGCCT ACCTGGGGTG TTACCAGACA CTAGCTCAAG CTCAAATCTG
GCCTTGCCAA TTTTGCTTGG CGTATTGGCT GCCTGTGTAT TGGTTGGCCT ACGTTTGCGC
AAATTGGCCT AA
 
Protein sequence
MLQTIVRYRW LLVFVGFLVA VPIAAQTAEK NKGADWLLTQ QQSDGSFDSY YHSPLDPTAC 
SVYALHAAGY QIDPATQRFI EQQAQSYIGH PAKASAIVMA QLLTGHDPRS VGGVDLVEAI
IKSYDPATGM YGENLYENSL VMMALKAAGE AIEPQAIQTI LDQQLADGSW STSTQNTALQ
IQALVAADQK QSAAIPAALA FLQTQQDFDR GFMNNRDFEP TAFKDAISTS LAIQAILATG
GDPKAEPWGD NINNPINALR RLQLADGGFR LDSSTPQPDT MSTCSSVQAV LQKTYPFIEL
ANIGITLTPT LAPADGSTAT PVQQPGLPGV LPDTSSSSNL ALPILLGVLA ACVLVGLRLR
KLA