Gene Tery_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3693 
Symbol 
ID4243868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5670688 
End bp5672076 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content38% 
IMG OID638108640 
Productcytochrome P450 
Protein accessionYP_723227 
Protein GI113477166 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.429017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.314695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTAC CTGATGGCCC AAGTCTGTCA CCATTACAAC GAAGACTCCG GACATGGAAA 
TTTATTTTTA GTCCCTTATC TGCCATAGAA GAGCGATACT CTGAATATGG AGATATCTTT
AGAACGAATA CTAACTCCTT GTATCCTTTC ATCTACTTCT GCAATCCTAA AGCCATTCAA
CAAATTTTTA CCGCGGATCC TGATACTTTT ACCTCAGGAA GTATAAATGG TATTTTAAAA
TATTTTGTGG GCCTAAATTC TCTATTGCTC CAAGATGGCG ATCGCCACAA ACGACAAAGA
AAACTATTAA TGCCACCTTT TCATGGTGAT CGGATGCGTA AATATGGAGA CCTAATCTAT
AACATCACTT CTAATGTTAT TAGTCAGTGG AAAATAGAAC AACCTTTTCC TATTCGCAAG
TCAACTCAAG AAATATCTCT CAAAGTAATT CTTGCTGCTG TATTTGGTTT AGATCAAGAA
GGAAAAAGTT ATGAAAAACT TAGAGTTCTT ATGTCTGATC TTCTAGACTC TATGAGTTCT
CCCCTCAGCT CTACTTTTCT GTTCTTCAAT TTTTTACGAA AAGACTGGGG TCCTTGGAGT
CCATGGGGGA GATTTTTGCG CAAAAAGCAA GAACTCCATG AACTAATAAT TGCAGAAATT
CAAACTGCAA AGAAAGAAGG AAATCATCGT GATGATATTC TTAGTTTATT ACTAGAAGCC
CGTGATGAAG CAGGTAATGC TATGAGCGAC GAAGAAATTA AGGATGAACT ACTGACAATG
CTTTTCGCTG GTCACGAAAC TACGGCATCA GCTTTAGCAT GGGCATTATA TTGGATTGAT
ATGATCCCAT CAGTGGGTGA AAAACTCATG GCAGAATTAG CAACTATTCC TAGTAACTCG
GATCAAGTTG CTATTACTAA ACTTCCTTAC CTCAGCGCTA TTTGTCAAGA AACTCTTCGC
ATTTATCCTA TTGCTATGAA TGCTTTCCCT AGAGTTGTTC AGAAACCTAT AGAAATTATG
GGTTATCAAC TTGAACCGGG AATGGTGGCG ATAGTGCCTA TTTATCTGAC TCATCATCGG
GAGGATATTT ATCCAGAACC TAAAAAGTTT AAACCAGAAC GTTTTCTGGA AAGACAATTT
TCACCTTATG AATATTTACC ATTTGGAGGG GGTAGTCGTC GTTGTATAGG TTCAGCTTTT
GCTTTATTTG AAATGAAATT GGTATTGGCA ACAATTTTAT CACAGTGGGA ACTTAAGTTA
TTGCCTAACC AAAGAATTAG CCCTGTCCGG AGAGGGTTAA CTATGGCGCC ACCAGCAAAT
ATGCGGATGG TTGTGAAACC AAAAAAATCG TGGCAGAAAG TTAGCCAGCC TATTTTAACG
TCTGGTTGA
 
Protein sequence
MTLPDGPSLS PLQRRLRTWK FIFSPLSAIE ERYSEYGDIF RTNTNSLYPF IYFCNPKAIQ 
QIFTADPDTF TSGSINGILK YFVGLNSLLL QDGDRHKRQR KLLMPPFHGD RMRKYGDLIY
NITSNVISQW KIEQPFPIRK STQEISLKVI LAAVFGLDQE GKSYEKLRVL MSDLLDSMSS
PLSSTFLFFN FLRKDWGPWS PWGRFLRKKQ ELHELIIAEI QTAKKEGNHR DDILSLLLEA
RDEAGNAMSD EEIKDELLTM LFAGHETTAS ALAWALYWID MIPSVGEKLM AELATIPSNS
DQVAITKLPY LSAICQETLR IYPIAMNAFP RVVQKPIEIM GYQLEPGMVA IVPIYLTHHR
EDIYPEPKKF KPERFLERQF SPYEYLPFGG GSRRCIGSAF ALFEMKLVLA TILSQWELKL
LPNQRISPVR RGLTMAPPAN MRMVVKPKKS WQKVSQPILT SG