Gene Rcas_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3969 
Symbol 
ID5541475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5175011 
End bp5176030 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content59% 
IMG OID640896077 
Producthexapaptide repeat-containing transferase 
Protein accessionYP_001434020 
Protein GI156743891 
COG category[R] General function prediction only 
COG ID[COG0110] Acetyltransferase (isoleucine patch superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.520917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0660336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGC AGTCTTCCTC AGCGCATGGA AGCATTCTTA TCCCTATGCA CACGCTGTTT 
GTCGTGTTAA TCCTGCTAGC GCCGCCGCCG CTTAAGCCCT GGTTGATGCG GACGCTGCTC
GGCGCGCGTG TCGGACGGAA TGTGCGCGTC GGTTGGTTCG CTGGCATATC GGCGCGCCAC
ATTGCCATTG GTGACGAAAG TGATATTCGG GCGTTGACCT TCATCAGTTG CCACGGCGAT
GTGATCATCG GTCGCTACTC GATTATCAGC AGTTTCGTCC TGGTGTATGG CGCCGCAGAC
CTGATCATTG GCGACCATGC GTATATTGGT CCGCAAACGT TCATCAATTG TGATGAATGT
GTTCGCATTG GCAACTATTC CGCACTCGGC GCGCGCTGCA TGGTCTACAC GCACGGGTCA
TTCTTCCCGT ACACCGAGGG CTACTGGGTG AAGTTCGGAC CGGTCACGAT CGGCGACTAT
GTCTGGTGCG CGGCCGGTGT CTTCATTCAT CCAGGAGTGA CAATCGGCGA CCATGTGTTT
ATCAATTCGC GTTCGGTCAT TACCCGCGAT GTTGCCTCCG GCGATGTGGT CGAGGGCTTT
CCGGCGCAAA CGGTCACGAC CATGAACCGC CTGAAACGCA GCATGTCGCC GCGCCGCCGT
GATGCGGCTG CGCGTCGGAT TCTCGATCAC TTCGTCGATC TCGGCGTGCG GCGTGAACTG
CGCCTCGCCG TCGAGCAGCG CGATGGGCAG GTCGCCTTTC GGTATCGTGG GAGGAAGTAC
CGACTGCTGT GCATCCCTTC AGACGGCGCC CCGCCATCGT TCGATAACGG ACCCGCATGT
CACATCGTTG CACTGGTGAC TCGTCCTGAT TGGACGCCGC CAACCGGCGC GCCGATCTAT
CCGCTCGACC TGATTGCCTA CCGCACTCCG CGCAGCAACG ATCCCGTCCA TCATGCGTTG
CGCACCTTCC TGATGCGCTA CTACGGCGTG CAGGTCGAAT ACAGCGATGC CGCGAAGTAA
 
Protein sequence
MIAQSSSAHG SILIPMHTLF VVLILLAPPP LKPWLMRTLL GARVGRNVRV GWFAGISARH 
IAIGDESDIR ALTFISCHGD VIIGRYSIIS SFVLVYGAAD LIIGDHAYIG PQTFINCDEC
VRIGNYSALG ARCMVYTHGS FFPYTEGYWV KFGPVTIGDY VWCAAGVFIH PGVTIGDHVF
INSRSVITRD VASGDVVEGF PAQTVTTMNR LKRSMSPRRR DAAARRILDH FVDLGVRREL
RLAVEQRDGQ VAFRYRGRKY RLLCIPSDGA PPSFDNGPAC HIVALVTRPD WTPPTGAPIY
PLDLIAYRTP RSNDPVHHAL RTFLMRYYGV QVEYSDAAK