Gene Rcas_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0414 
Symbol 
ID5537876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp522970 
End bp524505 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content59% 
IMG OID640892576 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001430563 
Protein GI156740434 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATTG GCGGATTGCT TTCGCGCCAT GCGCGCTATC GTCCCAATCA CACTGCGGTG 
ATCGTCGGTG ATGTGCGGCT GACCTATTGC GAGTTTAATG CGCGGGTCAA CAAAGTCGCG
CATGCGCTCC TGAACCTGGG GCTGGCCAAA GGCGACAAGA TTGCGACAGT GCTGCCGAAC
TGCATGGAGT TGCTCGAAGT CTACTGGGCA GCGGCAAAGA CCGGTCTGGT GGTGGTGCCG
ATGAGCACCC TGCTGCGCGG TCAGGGGCTG GCATCACTCT TGCGCGACTC GGATAGTGCC
GCGGTGGTAA CCGATGCAAC GCATGCGCCG GCGCTCGACT CGGTTCGCGG CGATCTGCCG
ATTGCGCAGG ATCGGTTCCT GATCATTGAT GCTCCCGATC AACCGGGATA CCGCGATTAT
CAGGCACTCG TTGCGCCGAT GCCAGAGCAC GATCCAACCG GGATTGATCT CCACGCCAAT
GATCCATACA ACATTATGTA CAGCAGCGGC ACGACCGGTT TGCCGAAAGG CATCGTGCTC
ACGCACGGGG TGCGCGCCGG GTACGGGACA ATTTTTGCGT CGTCGTACCG CATCGTGCCG
GAGAGCGTTA TTCTGCATGC GGGCGCGCTC GTGTTCAACG GCGCCTTCCT CACGCTCATG
CCAGCATTCT ACCTGGGAAC AACCTACATC CTGATGAAGG CGTTCAATGC CCGCGAGTTG
ATCGAGACCG CAGCACGCGA GAAGGTGACA CATATTAAGA TGGTGCCGTC GCAAATCGTG
GCGCTGCTCA ACGAACCCGA CTTCGATGAG CAGCATCTGC CGTCAATCGA AATGCTTGGC
TCAGTCGGCG CACCGTTGCA CATGGAGCAC AAACTCGAAC TCGAACGGCG CTTTCCTAAC
CGCCTCTACG AACTGTACGG ACTGACCGAA GGGTTCATGA CTATTCTCGA CAAATACCAC
CGGGGAGAGA AACTCGCGTC GGTCGGCGTG CCGCCACCGT TCATGGAGAT CAAGATCATC
GACGACCAGC AGCGTGAGTT GCCGCCGGGT GAAGTTGGCG AAATCTGCGG GCGCGGTCCG
TTGATGATGA GCGGTTACTA CAAGCGTCCC GATCTGACAG CACAGGCGAT CATCGATGGC
TGGCTGCACA GCGGCGATAT GGGGTATGTT GATGAGGATG GATTCCTCTA CCTGGTGGAC
CGTAAGAAAG ACATGATTAT TTCTGGTGGC ATCAACGTCT TTCCGCGCGA CATCGAGGAA
ATCATCGTGC AGCATCCGGC AGTGCGCGAG GCGGCAGTCT TCGGCGTGCC AAGCGAGAAG
TGGGGCGAAA CGCCGCTGGC AGCGGTCATC CTGAAGGCGC CGGGGCTGGT AGCGGCAGAG
GAGTTGAAGG AATGGATCAA TGCGCGCGTC GAAGCCGGGT ATCAGAAGGT GTCGAAGGTC
GTGATCATGG ACGATTTTCC GCGCAGCGCA GCGGGCAAAA CGCTCAAGCG CGTCATGCGC
GACGAGTACT GGAAGGGGCG TGAGAGTAAG ATTTAG
 
Protein sequence
MHIGGLLSRH ARYRPNHTAV IVGDVRLTYC EFNARVNKVA HALLNLGLAK GDKIATVLPN 
CMELLEVYWA AAKTGLVVVP MSTLLRGQGL ASLLRDSDSA AVVTDATHAP ALDSVRGDLP
IAQDRFLIID APDQPGYRDY QALVAPMPEH DPTGIDLHAN DPYNIMYSSG TTGLPKGIVL
THGVRAGYGT IFASSYRIVP ESVILHAGAL VFNGAFLTLM PAFYLGTTYI LMKAFNAREL
IETAAREKVT HIKMVPSQIV ALLNEPDFDE QHLPSIEMLG SVGAPLHMEH KLELERRFPN
RLYELYGLTE GFMTILDKYH RGEKLASVGV PPPFMEIKII DDQQRELPPG EVGEICGRGP
LMMSGYYKRP DLTAQAIIDG WLHSGDMGYV DEDGFLYLVD RKKDMIISGG INVFPRDIEE
IIVQHPAVRE AAVFGVPSEK WGETPLAAVI LKAPGLVAAE ELKEWINARV EAGYQKVSKV
VIMDDFPRSA AGKTLKRVMR DEYWKGRESK I