Gene Haur_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3621 
Symbol 
ID5735482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4551188 
End bp4552354 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content54% 
IMG OID641280770 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_001546385 
Protein GI159900138 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGTT CTCCGCTCGA TGGCCTGCTC GTGCTCGACC TCTCACGGGT GCTGGCAGGC 
CCATATTGCA CCATGATTCT TGGCGATCTT GGGGCTGATG TGATTAAAGT TGAAGCGCCG
CAAGGCGATG ATACCCGCCG CTGGGGGCCG CCGTTTGTTG CTGGCGAGAG CGCCTACTAT
TTGGCGGTCA ATCGCAATAA ACGCTCATTG GTCGCCGATC TCAAAACTGC CACTGATGCT
GCGTTGATTC GCACAATCGC CAACCACTCC GATATTGTGG TTGAGAACTT TCGGCCTGGC
ACGCTCGAAC GCTTTGGCTT GAGTTTAGCC AGCCTGCGCG AGCAAAATCC CCGTTTGATC
ACGCTCACAA TCAGCGGAAT GGGCGCGACT GGGCCTGAAG CTGAGCTGCC AGGCTATGAT
TTTATTGTGC AAGCGACTGC AGGTTTGATG AGCATTACCG GACCCGCCGA GGGCGAGGCC
AGCAAAGTTG GAGTTGCTGT CGTCGATTTG ACGACTGGAA TGTTGGCGAG CAACGCGATT
TTGGCGGCGT TGTATGCCCG AGAACGCACA GGTTTGGGGC AGCATATTGA TATTTCGCTG
TTAGAAGCCC AAGTTGCTTG GTTGGCGAAT GTTGGCAGTG CCTATTTGGT AACTGGCGAG
GAGCCAAACC GCTATGGTAA TGCTCACCCT ACGATTGTGC CCTATCAAAC CTTTGAGGCC
AGCGATGGCG CGTTTGCCTT GGCCGTCGGC AATGATCGCC AATGGCAACA GTTATGTGCT
GTTTTGCAAC AACCAGCTTG GGCGGAAGAT CCACGCTTTA GCACCAACCC GCAACGAGTG
CAGTACCGCA GCGCATTGGT TGATTTGCTG AATCACCAGT TCGGCCAAAA ACCAAGCGCC
GAGTGGGTGG CGTTAATTCA GGCGGCGGGC ATTCCGGTGG GCGCAGTGCG CAGCGTTGGT
CAAGTGTTGA ACGACCCGCA GGTGTTGGCA CGTGAAATGG TGGTCAGTGT GCCACACCCA
ACGATTGGTG ATTTAAAAGT CTTGGGAATT CCGTTTAAAT TCTCGGCAAC ACCCGCGAGT
ATTCGTTTGG CCCCACCACT ACTGGGACAA CATAGCCAAG AAATTCACCA ACAATTTAAC
CCAAAAGTAC CATCTAAAAC CGACTGA
 
Protein sequence
MLSSPLDGLL VLDLSRVLAG PYCTMILGDL GADVIKVEAP QGDDTRRWGP PFVAGESAYY 
LAVNRNKRSL VADLKTATDA ALIRTIANHS DIVVENFRPG TLERFGLSLA SLREQNPRLI
TLTISGMGAT GPEAELPGYD FIVQATAGLM SITGPAEGEA SKVGVAVVDL TTGMLASNAI
LAALYARERT GLGQHIDISL LEAQVAWLAN VGSAYLVTGE EPNRYGNAHP TIVPYQTFEA
SDGAFALAVG NDRQWQQLCA VLQQPAWAED PRFSTNPQRV QYRSALVDLL NHQFGQKPSA
EWVALIQAAG IPVGAVRSVG QVLNDPQVLA REMVVSVPHP TIGDLKVLGI PFKFSATPAS
IRLAPPLLGQ HSQEIHQQFN PKVPSKTD