Gene Tery_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4067 
Symbol 
ID4242095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6279994 
End bp6281409 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content31% 
IMG OID638108970 
ProductO-succinylbenzoic acid--CoA ligase 
Protein accessionYP_723551 
Protein GI113477490 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.846169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000722409 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAACCTAA TCTTTAACTA CCTAAAAAAA TACCAAGTCA AAAGAAGTAA AACTTTTTTG 
AGCTCTAGCG AATTTTCTTA TCTTACGAAC AAAAAAATTA AAAACTTAAT CAAAATTGCA
AATCCAAGAA CTCCTAGCAA AGTATTAATA TCAGAATCAA ATCCAACAGA ATTTATTTCT
AGTTTTCTTG CTGCTGTTGG TGCTAATTGT CAAGTATTTT TATGTAATCC AAAATGGGGA
CAACTAGAAT GGGAAAAAGT TTTAAAATTA GTAGAACCAG ACATGATTTT GGGAAATATT
CTTAACCATA AATCTCTCGA AAAGTCTTTA GAAAAAATAT CTAGTTTGTC TAGAGATAAC
CCCTGTAAAA AAACTCTGAC TACTGAGGAA AATTTGATTA TGATTCCTAC TGGTGGCTCA
TCTGGCAAAA TTAAATTTGC TATTCATACA TGGGAAACTT TAATGTCATC TGTAAGAGGT
TTCCAAGGAT ATTTCCAAGT ACAAGAAATT AATTCATTTT GTGTTTTACC CCTATATCAT
GTCAGTGGTT TAATGCAGTT TATACGCTCC TTTACTACTG GGGGAAATTT AATAATTTTG
CCATCCTATA AAGATATTTT AGAACAAAAA GAATGGAATA TTAACCCCAA TGAATTTTTC
ATTTCCCTAG TACCAACTCA GTTACACCAT TTGTTACAAA AGGCAGAAAC AGCTAACTGG
TTATCTAATT TTAAAATAGT GCTTTTAGGT GGTTCCGCAG CTTGGGAAGA ATTATTTGAT
GCTGCAAGAA AATATCAAAT TAAATTAGCT CCAACTTATG GAATGACAGA AACTGCTTCT
CAAGTTGCGA CTCTTAAACC ACAAGATTTT TTGGCAGGAA ATAATAGTAA CGGTCAAGTA
TTACCTCACG CTAAAATTAT TGTGAAAAAT GAAAGTGGGA AAATATTATA TCAAAATCAA
ATTGGTAATA TTAGCATTAA AGCTAATTCT TTGGCGTTAG GGTATTATCC TGATATATTT
AATAATTATG AAAGTCTAGT AACAGATGAT TTAGGATTTT TTGATCATCA AGGTTACTTA
AAAATAGTAG GTCGTAGTAG TCAAAAAATT ATTACTGGTG GGGAAAATGT TTTTCCGGCA
GAAGTTGAAG CTGCTATTTT GACAACTGGT TTAGTTGATG ATATTTGTGT AATTGGCTTA
GCAGATAAAT ATTGGGGTGA AGTTGTAACT GCTGTTTATG TGGGTAATTA TTTTGAAGTT
TCTAAGGAAA AGTTGTTAGC TGCTATTGAT AAAAAATTGA GCAAATTTAA GCAGCCTAAA
TATTGGCTAA GAGTAGAAAG TTTACCTCGT AATTCTCAAG GAAAAATTAA TCGAGAGTTT
TTAAAAGAAA TTGCTATTCA AAGAATAGGA GAATAG
 
Protein sequence
MNLIFNYLKK YQVKRSKTFL SSSEFSYLTN KKIKNLIKIA NPRTPSKVLI SESNPTEFIS 
SFLAAVGANC QVFLCNPKWG QLEWEKVLKL VEPDMILGNI LNHKSLEKSL EKISSLSRDN
PCKKTLTTEE NLIMIPTGGS SGKIKFAIHT WETLMSSVRG FQGYFQVQEI NSFCVLPLYH
VSGLMQFIRS FTTGGNLIIL PSYKDILEQK EWNINPNEFF ISLVPTQLHH LLQKAETANW
LSNFKIVLLG GSAAWEELFD AARKYQIKLA PTYGMTETAS QVATLKPQDF LAGNNSNGQV
LPHAKIIVKN ESGKILYQNQ IGNISIKANS LALGYYPDIF NNYESLVTDD LGFFDHQGYL
KIVGRSSQKI ITGGENVFPA EVEAAILTTG LVDDICVIGL ADKYWGEVVT AVYVGNYFEV
SKEKLLAAID KKLSKFKQPK YWLRVESLPR NSQGKINREF LKEIAIQRIG E