Gene Haur_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3958 
Symbol 
ID5735819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4965375 
End bp4967498 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content58% 
IMG OID641281108 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001546718 
Protein GI159900471 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGG CTTTTATCCA AACGACATCG ACGAAATATC CCCAGAAAAC CGCAGTTATT 
GATGGACCCA GACGTATAAC CTACGAGCAA CTTGCCGCTT CCATCGGTTC ATTCGCCAAT
GAACTGACTG CGGCCGGGGT TACTGAAGGC GAAAGCATCG CCTTGGTGCT ACCAAATTGC
GCGGAGTTTG TCATCGGCTT TTACTCCACC CTGCATATCG GCGCGGTTGT TTTGGCACTC
AATCCTCTGT TGAAGCACAA CGAGATCAAC TACTATCTTG CCGATGCCCA GGCCAGGGTT
ATTCTCACTA CCAAGCTGTA CATGGGCATG TGCCGCGAGA TCGTCGCCGC AGCTGGCCGC
TCAATCGAAA TCATTGCCCT GGATGGTGTG CTGGAGGGTT CCCGCGCTGC CGCATCGGAA
CGCGCCGCGC CCGCGGCTGC CGACCCGCAT CGGCCAGCGT TGTTTCAGTA TTCCTCTGGT
TCGACCGGGC GCCCGAAAAA AGTGATGCGC ACCTATGGCA ATCTGTGTGC CGAGGGTGAT
AATTTTACCG CCACTGTCGG TATGACCCAC GACGATGTGA TTTTGTGCCT GGTTCCGCTT
TTTCATGCCC ACGGGCTGGG CAATTGCTTG CTGGCTGCCA CCATGGTCGG AGCAACCCTA
GTGATTTTGG AGCAGCCGAT GGACGGAAAT GCTGTTGTCG ACATGCCCTT TATCGCCCGT
TGTGCCAGAG TCTTCGAGCT GATCGAAATC GAGCGGGTTA GCGTTTTGCC CGGCGTCCCC
TATGTGTTTA GCGCACTCAG CAGTGCACAG GTGGGCTTCG AGCCTGCGCT GGGATCGCTG
CGTTTGTGTT TTTCCGCCGG CAATTTCTTG ACCAGGGATG TGTTCGACGC CTTCCTGGAC
CGCTTTGGCA TTGCCATCAA GCAACTCTAC GGCTGTACCG AGGCGGGATC GGTCACCATC
AACCTTGAGG ATGATCCGAG CCTCGCCGCC AGCGTGGGGC TGCCGATACG CAATGTTGAA
CTCCATATCT GCGATGAGCA AAAAAACCGG CTCGCGCCCG ACGCCATCGG CGAAATAGCT
TTCAAAAGCC CCATGCTGAC CAGCGGCTAT GTCGGCCTGG AAGACATCAA CCGGGACATG
TTTCGCGATG GCTTTTTCTT CACCGGGGAT CTCGGCAGGC TTGATGAGGC CGGGCGCTTG
ACCATCACCG GTCGCAAAAA AATTTTTATC GATGTCGGCG GCAGGAAGGT CGATCCCCTG
GAGATCGAGG ATGTCCTGCT AACGCATCCC CGGGTCAAGG AAGCGGTCGT CGTCGGCATC
AAGGCGCCCT ATGGCGGCGA GTTTGCCAAG GCTGTGGCCG TGCTGGACGG CGAATGCACC
CAGACGGAGC TCCTCCAGTA TTGCAAGGAC CGCTTGGCCG ACTTCAAAGT CCCGCGCATG
ATTGAATTCC GCAACGAGAT TCCAAAAAGC CCCCTCGGCA AAATTTTACG CAAAAATCTG
GTTGATGACT CAGCCGTGGC GGAGGTTGAA GCCCTTGGAT CAACCCTGAG CCAGCACATG
CGATCCACTT CGTCTAGGGA ACAGCGCCTT TCGCTGGCCA AGCAATGCGT GCGCCAGCAG
ATTGCCCGCA TCTCGGGTCT TGATGTCGCC CAGATCGGCC TTTCGAACGC CCTCAGCGAC
TTCGGGCTTG ATTCGGCGCG GGCGATTGAA TTGCAGATGT CCCTGGAGAA TCTGATGGGT
GCGGGTTTAT CGGCCACGAT GGTGTGGCAA TATCCCGATC TGGACTCGTT GAGCGGGTAT
CTGGTGGATA TTGTTGACGC GCAGACGGCG GGCGCCGATC CCGCAGCCGT GCCGGATCGG
GCGGCCGCGC CGCCGGCCGC TCGCCCCTCC GCTATCCAGG CGATCGACGA CCTTTCAGAT
GATGCCATTG AGGCGCTTTT GCGTTCACAG GTCGATGGCA TCCTCCAGCC GCAGAACACG
ACAAACGCCC ATACCATCCC CGGTTTGAAT GAGGGTGACT CGGCGGGTAT TGATCGGCTT
GCCCAACTTT CCGACGAGGA TGTCACCGAT CTGCTGCTCA AGGAATTCGC ACGACTAAGC
CGAACCGGCC AACCTGAAGC ATAG
 
Protein sequence
MSEAFIQTTS TKYPQKTAVI DGPRRITYEQ LAASIGSFAN ELTAAGVTEG ESIALVLPNC 
AEFVIGFYST LHIGAVVLAL NPLLKHNEIN YYLADAQARV ILTTKLYMGM CREIVAAAGR
SIEIIALDGV LEGSRAAASE RAAPAAADPH RPALFQYSSG STGRPKKVMR TYGNLCAEGD
NFTATVGMTH DDVILCLVPL FHAHGLGNCL LAATMVGATL VILEQPMDGN AVVDMPFIAR
CARVFELIEI ERVSVLPGVP YVFSALSSAQ VGFEPALGSL RLCFSAGNFL TRDVFDAFLD
RFGIAIKQLY GCTEAGSVTI NLEDDPSLAA SVGLPIRNVE LHICDEQKNR LAPDAIGEIA
FKSPMLTSGY VGLEDINRDM FRDGFFFTGD LGRLDEAGRL TITGRKKIFI DVGGRKVDPL
EIEDVLLTHP RVKEAVVVGI KAPYGGEFAK AVAVLDGECT QTELLQYCKD RLADFKVPRM
IEFRNEIPKS PLGKILRKNL VDDSAVAEVE ALGSTLSQHM RSTSSREQRL SLAKQCVRQQ
IARISGLDVA QIGLSNALSD FGLDSARAIE LQMSLENLMG AGLSATMVWQ YPDLDSLSGY
LVDIVDAQTA GADPAAVPDR AAAPPAARPS AIQAIDDLSD DAIEALLRSQ VDGILQPQNT
TNAHTIPGLN EGDSAGIDRL AQLSDEDVTD LLLKEFARLS RTGQPEA