Gene Rcas_1621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1621 
Symbol 
ID5539097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2094693 
End bp2096213 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content65% 
IMG OID640893758 
ProductMazG family protein 
Protein accessionYP_001431731 
Protein GI156741602 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.909827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGC TTGTGGGGCT TGGTCCCGGC GATCCGGGCT TGATCACGCG CGCGGCGTGG 
GAGATGATTT CCGCAGCGCG CGTGCTCTAT CTGCGCACCG CTGTGCATCC GACGGTCGCT
GCGTTGCCGC CGTCGGTCGT CGTGCGCGCC TTTGACGATC TGTACGAACA GGCAGAGCGT
TTCGACGAGG TGTATGAGCG GATCGCCGAT GAACTGATCG CGCGCGCGCG CGGCGGTGAG
GCCGTGGTGT ATGCGACGCC GGGCGACCCG CTGACGGCGG AAGCCACATC GCGCCATCTG
CTGCGCCGCG CGCGCGCGCA GGGCGTTCCG GCGCGGGTTG TCCCCGGCGT CAGTTTTGTC
GAACCGGTCT GTGCGCTGTT GGGTGTCGAT CCACTCGAAC ATGGGTTGCA ATTGCTCGAT
GCGCTCGATC TGATGGTCGG CGACACAACG GTCGATGCGC CTTCGTGGGC GTCGCTCCAC
GGCTTTACGT ACACGCCGCC GCTCCTGCCG TTTCCGCTGA CGCCGACGCG CCCGGCGCTG
ATCTGTCAGG TCTACAGCCG GTCGGTCGCG TCACACGTCA AACTGTCGCT CCTGGAACGC
TACCCGGTTG ACCATCTGGT GACGCTGGTG CGCGCCGCCG GGGTCGTCGA TGCGGAGGCG
GCGGTTGAAT TGCCGCTCCA CACGCTCGAC CATCGCAATG ATTTCGATCA CCTGACGAGT
CTGTTTGTGC CGCCACTGAC GCCCCTTGCC GACCTGCGCG GACCGGACGG TGTGGCATAT
GTCGTCGCGC GGTTGCTTGG TCCGGGCGGG TGCCCGTGGG ATCGTGAGCA GACTCCGCAA
TCGTTGCGGG CATCGTTGCT CGAAGAGGTG CATGAGGCAT TGGAGGCGCT CGATGCCGGC
AACGACGAGG CGCTGGTCGA AGAACTGGGC GATGTGCTGA TCAATGTGCT GATGCTGAGC
GAAATGGCGC GTCAGGCAGA GCGCTTCGAC GCTGGCGAGG TGTTCAATGC CGTGGCTGGC
AAGTTGATCC GCCGCCATCC CCACGTCTTC GGCGAGCTGG ATGTCGCAGC GAGCGATCAG
GTCTTGCACA ACTGGGAAGC GATCAAGCGC GCCGAGCATG CCACAAAAGG GGTGTCACGC
CAGAGTGCGC TCGATGGCAT TCCGCCATCA TTGCCCGCGC TGGCAGCCGC GCAGAAGGTG
GTGTCGAAGG CCGCCAGAGC CGGGTTCGAT GCGCCGGAGA TTGACCACGC CTGGGATGCC
CTGGCGGAAG AACTTGCCGA ACTACGCGCC GTCACAACCG ATCCTGCGCA GGCGGAAGCA
GAATTGGGCG ATCTGCTCCT GGCGGTTGCC CGTCTGGGGT GGCGGCTCGA TGTGGATGCG
GAAAGTGCGT TGCGCGCAGC GGTTGCGCGT TTTCGGCGCC GCTTCGCGCG CCTTGAAACG
TTGCTCAACG GGCGCGATCT TCGCTCTCTG AGCATCGACG AAAAACTGGC ACTGTGGGAA
CGCGCGCGTG ACGATGGCTG A
 
Protein sequence
MITLVGLGPG DPGLITRAAW EMISAARVLY LRTAVHPTVA ALPPSVVVRA FDDLYEQAER 
FDEVYERIAD ELIARARGGE AVVYATPGDP LTAEATSRHL LRRARAQGVP ARVVPGVSFV
EPVCALLGVD PLEHGLQLLD ALDLMVGDTT VDAPSWASLH GFTYTPPLLP FPLTPTRPAL
ICQVYSRSVA SHVKLSLLER YPVDHLVTLV RAAGVVDAEA AVELPLHTLD HRNDFDHLTS
LFVPPLTPLA DLRGPDGVAY VVARLLGPGG CPWDREQTPQ SLRASLLEEV HEALEALDAG
NDEALVEELG DVLINVLMLS EMARQAERFD AGEVFNAVAG KLIRRHPHVF GELDVAASDQ
VLHNWEAIKR AEHATKGVSR QSALDGIPPS LPALAAAQKV VSKAARAGFD APEIDHAWDA
LAEELAELRA VTTDPAQAEA ELGDLLLAVA RLGWRLDVDA ESALRAAVAR FRRRFARLET
LLNGRDLRSL SIDEKLALWE RARDDG