Gene Rcas_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4089 
Symbol 
ID5541600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5299895 
End bp5301088 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content62% 
IMG OID640896201 
Productarginine biosynthesis bifunctional protein ArgJ 
Protein accessionYP_001434139 
Protein GI156744010 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.245941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTACA ATATTATCGA AGATGGTCAT ATTTCGAGTC CGGCAGGATT TCGTGCCACC 
GGCGTTTCCT GCGGATTGAA GGAGATTCGC GCACGCGATC TGGCAATCGT CTACTCACAA
TTGCCCTGCC GTGTCGGGGC GCTCTTTACG ACGAACCTGA TCGTGGCTGC GCCGATTTTC
TTCAATCAGG CGATCCTGGC GCGGAACCGC GATGCCATCC GCGCTGTCGT CATCAATGCC
GGGCATGCCA ACGCCGGTAC CGGTCAACCG GGACTTGCGA CCGTCGTGGA GTGCGCCAAG
ATTGCAGCCG ATGAACTCGA AATACCGCGC GATAGCGTGT TGATGCTTTC AACCGGGCAG
ATCGGCGTTG CACCGCCGCT CGACCGTATG CGAGAAGGAA TCCGGCGTGC AGCGTCTGAA
CTGGACAGTA ATGGCGGGCG CCGCGCAGCG CTTGCGATCC TGACGAGCGA TACGCGCCCA
AAAGAACGCG CCTTCCGCGT GTCGCTGCGC GAAGGGCGAA CGGTCACGTT GGCCGGTATG
GCGAAAGGAA CGCGCATGGT CAGCCCGCAC CTTGCCACGC TGCTCTGCGT GATCACCACC
GACGCGCCGA TTGAGTCGCG CTTGTTGATG CGTGCGCTCG ACCAGAGTGT CAATCGTTCG
TTCGGAAGGT TGCACATCGA CGGCGATATG AGCCCAAACG ATGCGGTGCT GGTGCTGGCA
AACGGCGCGG CTGGAGGCCC GCCGATTATT GATGGATCGC GCGAACTCGG CGTCTGGCAA
CAGGCGCTCG ATGCGCTGTG CCACGATCTG GCGCAGCAGG TGTTGCGTGA TGCAGCGTCG
GGTGGGAAGC ATATCCTCAT TACGGTGCGT GGCGCATCCA ACGATGCCGC CGCCTCGCAG
GTGGCGCGCG CGGTTGCTCG GTCGACAGCC GTGCGCCATA TGTGCGCGCG CAATCTACCC
GATTGGGGCG GGATGCTCGT CGCCGTCGGC GCAAGCGGCG TGGACCTGCG CCCCGATATG
CTTGAACTGC GCATCGGCGC CGTCACGGTG ATGGATGATG GAGCGCCGGT GCGTTTCGAT
CCGACGGCGC TGGTGCAGGC GCTATCCGGT CCGGAAGTCG AGTTGGCGAT CGACCTGCAT
ACCGGCGCCG GTACGGCAAC CGTATGGACG TGTACGACCG GTATGGAGCC ATAA
 
Protein sequence
MSYNIIEDGH ISSPAGFRAT GVSCGLKEIR ARDLAIVYSQ LPCRVGALFT TNLIVAAPIF 
FNQAILARNR DAIRAVVINA GHANAGTGQP GLATVVECAK IAADELEIPR DSVLMLSTGQ
IGVAPPLDRM REGIRRAASE LDSNGGRRAA LAILTSDTRP KERAFRVSLR EGRTVTLAGM
AKGTRMVSPH LATLLCVITT DAPIESRLLM RALDQSVNRS FGRLHIDGDM SPNDAVLVLA
NGAAGGPPII DGSRELGVWQ QALDALCHDL AQQVLRDAAS GGKHILITVR GASNDAAASQ
VARAVARSTA VRHMCARNLP DWGGMLVAVG ASGVDLRPDM LELRIGAVTV MDDGAPVRFD
PTALVQALSG PEVELAIDLH TGAGTATVWT CTTGMEP