Gene Rcas_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3355 
Symbol 
ID5540854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4380902 
End bp4382125 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content63% 
IMG OID640895473 
Productarginine biosynthesis bifunctional protein ArgJ 
Protein accessionYP_001433423 
Protein GI156743294 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCAG ACACTTTCTC ACTCGCATCC GGTTTTGCTG CGACGGCGAC CGCCTGTGGA 
CTGAAGCCGA ACGGCGCGCT CGACATGGCG CTGATCGCAA CGGACGCTCC GTGCAGCGCC
GCCGGTCTGT TCACTACTAA TCGGATCAAA GCGGCGCCGG TGATCTATGA TCAGGATGTG
CTGGCAGCGA ATGCCGGTGC GATCCGCGCG GTTGTGGTCA ATGCAGGCAA CGCCAATGCC
TGCACCGGAC CGCAAGGCGA CGCCAACTGC CGCGCAATGG CGGCGATGAC TGCCGAACGC
CTCGAATGCC GCGCCGATCA GGTGTTGGTC CTTTCGACCG GCGTGATCGG CAGGCAACTC
GATATGACAA AGGTTGCTCA GGGAGTCGCC AGCCTGACCG GACCGACCGC GCACCGGGGT
GCAGGCGCGG CTGCGCGCGC CATCATGACT ACCGACACCC GCCCGAAAGT CGCTGCGCGC
ACGACCTCAG TTGCCGGAAA GCCTATCACC ATTGCCGGAA TGTGCAAAGG CGCCGGGATG
ATCCATCCCA ATATGGCGAC CATGCTGGCG ATTGTGACGA CCGATGCGCA GGCGTCACCC
GCAACCCTCG ATCGTGCCCT GCGCTATGCC GCCAATCGTA GTTTCAACCG CGTCAGCGTC
GATGGTGATA CCAGCACAAA TGACACCCTC CTCCTGCTGG CATCCGGCGC GTCAGGCGTT
CGGGTGAGCG ATACGCCGGA AGCGGACGAT TGTTCGTTTG ACCAATTCAC GGTGTTGCTC
ACCGAAGTGT GCATCGATCT GGCGAAACAG ATCGCGCGCG ATGGCGAAGG TGCCACGCGC
CTGGTGGAAA TTGTGGTCAG CGGCGCACAG GACGAGCAGC AGGCGCACCA GGTGGCGAAC
GCCATCGCGC GCTCGCCGCT GGTGAAAACC GCCATCCATG GCGGCGATCC GAACTGGGGG
CGCATTGTGT GCGCCGCTGG CTACAGCGGC GCTGCCATCG CCCCCGACCG ACTGGCGCTC
TGGTTTGGTC CGGCGGACTC ACGGGTTCAG TTAGTTGCCA ATGGGCTGCC ACTCGATGCC
GACCTGGCAG CGGCTTCGGC GCTCCTGCGC CAGGACCCGG TCTTCATCAC GCTCGACCTT
GGGCTGGGTA GCGCGCATAC GACCGTCTGG ACGTGCGATT TCAGCAAGGA GTATGTTGAA
ATCAATGCGC ACTATACCAC CTGA
 
Protein sequence
MQADTFSLAS GFAATATACG LKPNGALDMA LIATDAPCSA AGLFTTNRIK AAPVIYDQDV 
LAANAGAIRA VVVNAGNANA CTGPQGDANC RAMAAMTAER LECRADQVLV LSTGVIGRQL
DMTKVAQGVA SLTGPTAHRG AGAAARAIMT TDTRPKVAAR TTSVAGKPIT IAGMCKGAGM
IHPNMATMLA IVTTDAQASP ATLDRALRYA ANRSFNRVSV DGDTSTNDTL LLLASGASGV
RVSDTPEADD CSFDQFTVLL TEVCIDLAKQ IARDGEGATR LVEIVVSGAQ DEQQAHQVAN
AIARSPLVKT AIHGGDPNWG RIVCAAGYSG AAIAPDRLAL WFGPADSRVQ LVANGLPLDA
DLAAASALLR QDPVFITLDL GLGSAHTTVW TCDFSKEYVE INAHYTT