Gene Haur_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1556 
Symbol 
ID5733443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1806742 
End bp1807965 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content51% 
IMG OID641278695 
Producthypothetical protein 
Protein accessionYP_001544327 
Protein GI159898080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCTTA ATCAAGCGCC CCAACTCGAT TTAACCTATT GCACCAATAT TCATCCGGCC 
AACGGCTGGC CAGCGGTTTT AGCTGGTTTG CAACAGCATG TGCTCGATCT CAAACAGCGC
CTCGCTCCCA ACCAAGCCTT TGGGATTGGC CTGCGGCTTT CGGGCCAAGA AAGCCACCAA
TTGTTGGAGC CAACTGCGCT TGCTGATTTT CAAGCATGGC TAACTGAACA TAATCTTTAT
GTCTTTACCT TGAATGGCTT TCCCTACCAT CCCTTCCACC AACAACCAGT CAAGGATCAG
GTGCATGCGC CCGATTGGCG TGAGCCTGAA CGAGTGGCCT ATACCTTACG GCTGATTGAG
ATCTTGGCGG CACTGTTGCC CAAGGGCATG GTTGGCTCAA TTTCGACCAG CCCTTTGAGC
TATAAACCAT GGTTTGCCGA TTTGTCCGCC GTGCCGTGGG CATTGTTGAA TCGCCATGTG
TTGCAGGTGG TCGCCGCGTT GGTCCAGCTT GAGCGCCAAC GTGGGATTGT GATTCAATTA
GCTTTCGAGC CAGAGCCAGA TGGTTTGCTC GAAACCAGCA GTGAATTAAT CGGCTATGTT
GAGCAATTGT TGGATGTTGG CGCTGTTGAA TTAGCAGCTC AACTTGATTG CTCGTTGCGC
GAAGCCCAAA ATGCGATTCG TCGCCATGTC GGAGCCTGTT TGGATACCTG TCATTGCGCC
GTAGCCTACG AAGCGCCGCG CCACGTGATC GCTGCTTATC AAACGGCAGG CATCAGCATT
GCCAAAGTGC AACTTAGCTC AGCCTTGCAA GTGATGCTTG ATGACGATCG CCAGGCCGTA
GCAGCAGCTT TAGCACCATT CAGCGAAGCA ATTTATTTGC ACCAAGTGAT TCAGCGCAAC
CATGATGGTT CGTTGCAGCA ATATCGCGAT TTGCCTCAAG CCTTGGAAAA GATCGATGAT
CCTGCTGCTT GCGAATGGCG GATTCATTTT CATGTACCGA TTTTTACCGC CAGTTTTGGC
CTGCTCAACG CCACCCAACC AGCCTTGCTC GAAAGTTTGC AAGCCTTGCT CGAAAGTTTG
CAAGCCTTGA ACGAGCAGCC CTACAGCCAG CATTTGGAGA TTGAAACCTA TACGTGGGAT
GTGCTGCCAA GCCAATTAAA GCTCGATCTG ACTGAATCGA TCGCGCGGGA GTATGCGTGG
GTGTTGCATG AACTCAAACG CTAA
 
Protein sequence
MQLNQAPQLD LTYCTNIHPA NGWPAVLAGL QQHVLDLKQR LAPNQAFGIG LRLSGQESHQ 
LLEPTALADF QAWLTEHNLY VFTLNGFPYH PFHQQPVKDQ VHAPDWREPE RVAYTLRLIE
ILAALLPKGM VGSISTSPLS YKPWFADLSA VPWALLNRHV LQVVAALVQL ERQRGIVIQL
AFEPEPDGLL ETSSELIGYV EQLLDVGAVE LAAQLDCSLR EAQNAIRRHV GACLDTCHCA
VAYEAPRHVI AAYQTAGISI AKVQLSSALQ VMLDDDRQAV AAALAPFSEA IYLHQVIQRN
HDGSLQQYRD LPQALEKIDD PAACEWRIHF HVPIFTASFG LLNATQPALL ESLQALLESL
QALNEQPYSQ HLEIETYTWD VLPSQLKLDL TESIAREYAW VLHELKR