Gene Haur_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1856 
Symbol 
ID5733745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2159191 
End bp2160930 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content53% 
IMG OID641279000 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001544627 
Protein GI159898380 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGATC ATGATCTGAT GTTGGAGGCG GGTGGCAGTT CTGCGACGAT TCAATCCAAC 
CCTGCAAGTT TTGTTGAAAT TATTCTTCAG CGGGCCAAAA CCACTCCTGA AGCGATTGCG
CTTAATTTTT ATACAGCTGA TGGCTATCGT ACCAGCCTAT CATATGGCTT ATTGGCGCAG
CGTTGCCAAG CAATTGCTGC CGTTTTGCAA CGGGTGACCA AACCAGGCGA TCGCGCTTTG
CTATTATACC CGCCTGGCTT GGGTTATGTG GAAGCATTTT TTGGCTGTTT ATTCGCTGGC
GTGATTGCCG TTCCTGCTTA TCCGCCGCGA CCCAATCGCC GCGATCAACG TTTGGAAGCA
ATTATTAGTA ATGCTCAAGC ACGGGTGATG CTGGCAGCCA GCGAGGTCGT GGCTCAGCAA
CGCGCTTTAT CGCAACAATA TCCTGGCCTT GCTCAATTGC AATGGATCGC TAGTGATCTG
GTCAATAGCC AATTGGCCAG TATGTGGCAA GCTCCCAGCA TCAATACCCA TGATTTGGCC
TTTTTGCAAT ATACCTCTGG CTCAACTTCG CAACCACGCG GTGTAATGGT CAGCCACGAA
AATCTGGTCT ATAACTCGGG CTTGATTGCC CAAAGTTTTG GCATCACGGC TGATGATCAT
GTGTTTATTT GGTTGCCGCC TTACCACGAT ATGGGCTTGA TCGGTGGGAT TATGCAGCCG
TTGTTTACTG GCTGTGAATT AAGTTTGATC GATCCCCTAA CCTTTTTGCA ACAACCGCTG
ATTTGGCTCC GCATGATCAG CGATTTAGGG GTGACGGTCA CTGGCGGGCC AAATTTTGCC
TACGATTTAT GCGTTGCCAA AGCCAAGCCC GAAGCCTTGG CAGGCGTTGA TCTAAGCCGT
TTGCGGGTGG CATTTAATGG CGCTGAGCCA ATTCGTGCAG CAACCCTTGA GCGATTTAGC
CGCACTTTTG CCCCGTTAGG CTTCAAACCT CAGGCCTTCT TGCCGTGTTA TGGCTTAGCC
GAAGCCACGC TCTTTGTCAG TGGTGCGCCG CATGCTGCCG AACCAACCAC ACTCACGGTC
GATGCTCAAG CCTTGAGCCA ACATCAAGCG CTGCCAAGTG AGCGTGGGAC ATTGTTGGTA
AGTTCGGGCA TGGTGGCCGC GCCGCAAATT GTGGCGATTG TCGATCCTGA GCAGGGCCAG
GTTTGTGCCG ATGGTTGGGT TGGCGAGGTC TGCATTCACG GGCGTAGCAT TGCCCATGGC
TATTGGGATA ACTCCGCCGC TAGCGAGGCT ACTTTTCAAT TGATCTTGCC TGATGGCAGC
GGCCCCTTCC TGCGCACGGG CGACCTTGGT TTTATCCACG AAGGGCAGTT GTATATTACT
GGGCGGCTCA AAGACCTGAT TATTATTGAT GGGCGCAATC ATTACCCCCA AGACTTGGAA
TTGAGCGTTG AATTGGCGCA TCCGGCAATC CGCCAAGGTG GTTGTGCCGC TTTTGCCGTC
GATGGCGCTG ATGGTGAGCA AATTGTGATT GTGGCCGAAA TTCGCCGCCC CAATCAAGCC
GAAGAAGCGG CTCAGGCTGT GCGCTTGGCC CTCCAGCAAC AATACGATTT AGCAATTGCC
GATTTGATGT GGGTACGGCC TGGTCAAGTG CCCAAAACCT CAAGCGGCAA GGTGCGCCGC
CGCGAATGCC GCCAACGCTA TTTGAGCCAA ACCCTCAATA GCTTGGAGGG CGAGGAATGA
 
Protein sequence
MLDHDLMLEA GGSSATIQSN PASFVEIILQ RAKTTPEAIA LNFYTADGYR TSLSYGLLAQ 
RCQAIAAVLQ RVTKPGDRAL LLYPPGLGYV EAFFGCLFAG VIAVPAYPPR PNRRDQRLEA
IISNAQARVM LAASEVVAQQ RALSQQYPGL AQLQWIASDL VNSQLASMWQ APSINTHDLA
FLQYTSGSTS QPRGVMVSHE NLVYNSGLIA QSFGITADDH VFIWLPPYHD MGLIGGIMQP
LFTGCELSLI DPLTFLQQPL IWLRMISDLG VTVTGGPNFA YDLCVAKAKP EALAGVDLSR
LRVAFNGAEP IRAATLERFS RTFAPLGFKP QAFLPCYGLA EATLFVSGAP HAAEPTTLTV
DAQALSQHQA LPSERGTLLV SSGMVAAPQI VAIVDPEQGQ VCADGWVGEV CIHGRSIAHG
YWDNSAASEA TFQLILPDGS GPFLRTGDLG FIHEGQLYIT GRLKDLIIID GRNHYPQDLE
LSVELAHPAI RQGGCAAFAV DGADGEQIVI VAEIRRPNQA EEAAQAVRLA LQQQYDLAIA
DLMWVRPGQV PKTSSGKVRR RECRQRYLSQ TLNSLEGEE