Gene Haur_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2023 
Symbol 
ID5733912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2514968 
End bp2516194 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content51% 
IMG OID641279167 
Producthypothetical protein 
Protein accessionYP_001544794 
Protein GI159898547 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCT TAGTGCTCAA TCGTCAAAAG CCTCACTTAG CTCCGTTTGG CGATTGGCTT 
GGCGATTTAG TGCCACAGGC CCGCTTATTT ACGGCTGCCA ACCGTGTGCA GGGCTTTCAA
GGGTTTGCGG CGATTCAGCC ATTTGAGAAC TATGAAGACA GTGGCCTGAT TGAATTTGAG
GCTTTACGGC TGCATCGTCA ATCGCCAATC GAGCGAATTG TTGCAACTTC AGAGGTCGAT
ATTCTGCGTG CAGGCCGCTT ACGTAGCTAT CTTGGGTTGC CAGGCCAACA AGCCGATAGT
GCCTTGGCCT TTCGCAATAA AGTTGTGATG AAGCAACACC TGGTTAATCG CACTCAGCTG
GTCAATATCC CAATCTTTCA GGCGATCAAC GAGCCGTTCG ATATCATTCA ATTTATCGAA
CAGCATGGCT ACCCAGTAAT CGTCAAACCA GATGATGGCA GTGGCTCGCT GGGGGCAAAA
ATGCTGGCAA ACGAGGATGA TCTGGCCCAG TTTTTACAAC AGCCGCTGCC CCGTGGTTTA
GAAATTGAGT GCTTTATCCA AGGCGATCAA TATCATGTCG ATGGATTATT GGTCGATAAC
GAGGTCTGTT TCTGCTGGCC ATCGCAATAT CTTGGCAATG GTTTATCCTT TACCCAAGGC
TGGTTTACTG CGAGCCAGAT GCTTCGGCCC GAACATCACT TGACCCAGCG CTTAATCGCA
GCGGCCAAAG AAGTGTTGGC TTTGCTGCCA ACTCCACCCG TCACCAGCTT TCACCTTGAG
TTGTTTCATA CTCCTGGCGA TGAGCTGTTC TTTTGCGAAA TTGCCAGCCG CACTGGTGGC
GGTATGATCA ACGGAACAAT TGAGCAGGCA TTTGGGATTA ATCTAAATCA ACTCTTTATC
CAAGGCCAAG CTGGCATGCC GATTGATACG AGCCGATTAA GGGCGATCAC CCAACCCAAG
AAGATTGTCG GTTGGGGGTT GGTTCCACCG CAAGCTGGGG TTTTTCGCGG CTATCGCCAA
GCAAAACCAC CCCAACCATG GGTCCTCCAC TTCGATTGGA GCATTCAGGC AGGTACGCAC
TCACAACCAG CGCAAATGAG TGTTGATCAA GTCGGCGGGT TTATTGTTGA TCTGACTGAT
GCTCCTAACC CCGAAGAACG CTTGATTGAG GTTTGGCGCT GGGCCGAGCA CCAAGCACTG
TGGGAGCCAG CAGGAGTAAA TGCATGA
 
Protein sequence
MKILVLNRQK PHLAPFGDWL GDLVPQARLF TAANRVQGFQ GFAAIQPFEN YEDSGLIEFE 
ALRLHRQSPI ERIVATSEVD ILRAGRLRSY LGLPGQQADS ALAFRNKVVM KQHLVNRTQL
VNIPIFQAIN EPFDIIQFIE QHGYPVIVKP DDGSGSLGAK MLANEDDLAQ FLQQPLPRGL
EIECFIQGDQ YHVDGLLVDN EVCFCWPSQY LGNGLSFTQG WFTASQMLRP EHHLTQRLIA
AAKEVLALLP TPPVTSFHLE LFHTPGDELF FCEIASRTGG GMINGTIEQA FGINLNQLFI
QGQAGMPIDT SRLRAITQPK KIVGWGLVPP QAGVFRGYRQ AKPPQPWVLH FDWSIQAGTH
SQPAQMSVDQ VGGFIVDLTD APNPEERLIE VWRWAEHQAL WEPAGVNA