Gene Rcas_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3335 
Symbol 
ID5540833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4351226 
End bp4352764 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content58% 
IMG OID640895452 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001433403 
Protein GI156743274 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.277543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAATC TCGCCATTAT TCTGGAGGAG AGCGCGCGCC GGATGCCCGG AAAGACTGCT 
GTCATCCTCG ACAGCATTCG CCTGAACTAT GCCGAACTAA ACGGCGCCGC CAACAAAATT
GCCAACGGTC TGGCAAATCT CGGCGTGCGC CCTGGCGACA AAGTGGCGAT GATGATCCCC
AACACGCCAC ACTTCCCGAT GTGCTATTTC GGCATTCTCA AGGCTGGCGC GACGGTTGTG
CCGCTCAATG TGCTATTCAA GCGCGATGAG GTTCGGTATC ATCTGGAAGA CAGCGACTCG
GTCGCACTGA TCGTCTGGGA AGGATTCCTC GATGAAGCCG CGTCCGGCTT TCACGCGGTC
AAAACCTGCC GTCACCTGAT CGTCGCGCAG GCGCCGGGTT CCACGGCGAC ACTCCCCGAT
GGCGCGATTC CGCTCGGCAG TCTCCTCGCC GAACACGCTC CGGTATTCGA TACGGTCCAG
ACCATGCCGG ACGATACGGC GGTTATTCTC TACACCAGCG GCACGACGGG TCGCCCCAAG
GGCGCGGAAC TGACCCATGC CAACATGTTC TTGAATGCCA CGATCTGCAC CGATAAGTTG
CTGAATGTCT CGTCAGAAAC GGTGGGGCTG GCGGTACTGC CATTGTTTCA CAGTTTTGGA
CAGACGTGTG TGATGAACAG TCTGATCTAC GCCGGCGGCG CCATCACCAT GCTGCCGCGC
TTCGAACCGC AGAAGGCGCT CGAAGTGATG GCGCGCGACC GGGTGACGTA TTTCGCCGGC
GTGCCGACGA TGTATTTCTA TCTGCTCAAT TTCCCCGGCG CAGATCAGTA TGATCTGTCG
GCGCTTAAGT TCTGCGTCTC AGGCGGCGCA GCGATGCCGG TCGAAGTGAT GCATGCCTTC
AACCGTAAAT ACAATGTCAC CATCCTCGAA GGGTACGGTC TCTCCGAAAC TTCTCCGGTG
GCGTCGTTCA ATCATCTCGA CCGCGAGCCA AAGCCGGGGT CGATCGGCGT GCCGATCTGG
GGCATTGAGA TGCGCGTGGT GGATGACCAG GGGCGCGAGG TTCCCAACGG CGAACTTGGT
GAGATTGTCA TTCGTGGGCA CAATGTGATG AAGGGGTACT ACAAGCGTCC CGACGCAACT
GCTGATGCGA TTCGCAATGG CTGGTTCCAC AGCGGCGACA TAGCCTATCG TGACGATGAC
GGTTTCTTCT TCATCAAGGA TCGCGTGAAG GATATGATCA TCCGCGGCGG GTTCAATGTC
TATCCGCGCG AGATCGAAGA GGTGCTCTAC GGGCATCCAG CCATCGCCGA AGCGGCCGTC
ATTGGCGTTC CCGATCAGGC GCTTGGCGAG GAGGTCAAGG CGGTTGTCGC CCTGAAGCCA
GGGCATACGG CAACCGAGAC AGAGATTATT GAGTACTGTA AGGAACGCCT GGCTGCCTAC
AAGTATCCGC GCATCGTCGA AATCCGCGAG ACGTTGCCCA AGACGGCGAC CGGCAAAATC
CTCAAGCGCG AATTGCGTCA GATCGAAGTG ACTGCATAG
 
Protein sequence
MLNLAIILEE SARRMPGKTA VILDSIRLNY AELNGAANKI ANGLANLGVR PGDKVAMMIP 
NTPHFPMCYF GILKAGATVV PLNVLFKRDE VRYHLEDSDS VALIVWEGFL DEAASGFHAV
KTCRHLIVAQ APGSTATLPD GAIPLGSLLA EHAPVFDTVQ TMPDDTAVIL YTSGTTGRPK
GAELTHANMF LNATICTDKL LNVSSETVGL AVLPLFHSFG QTCVMNSLIY AGGAITMLPR
FEPQKALEVM ARDRVTYFAG VPTMYFYLLN FPGADQYDLS ALKFCVSGGA AMPVEVMHAF
NRKYNVTILE GYGLSETSPV ASFNHLDREP KPGSIGVPIW GIEMRVVDDQ GREVPNGELG
EIVIRGHNVM KGYYKRPDAT ADAIRNGWFH SGDIAYRDDD GFFFIKDRVK DMIIRGGFNV
YPREIEEVLY GHPAIAEAAV IGVPDQALGE EVKAVVALKP GHTATETEII EYCKERLAAY
KYPRIVEIRE TLPKTATGKI LKRELRQIEV TA