Gene Rcas_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3075 
Symbol 
ID5540571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3981384 
End bp3982751 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID640895194 
Productargininosuccinate lyase 
Protein accessionYP_001433147 
Protein GI156743018 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGGCG GTCGTTTCGA CGAAGGCATT GACGCGCGCA TGGCGCGGTT CAACAACTCG 
TTCCCGTTCG ATCAGCGCAT GTGGCGCGAG GATATTCGCG GAAGCATGGC GTGGGCGCGT
CAACTCGCAC AGGCAGGAGT CATATCGACG GAAGAGCGCG ACACACTGCT GACGGGTCTT
GAGACGGTCT TTGCTGAGTT TGCCAATGAT CGATTCGAGG CGCGACCAAC TGACGAAGAC
ATTCACACTG CCATCGAGCG CCGTCTGGGA GAACTCGTTG GCGCAGTCGC CGGAAAACTT
CACACCGGGC GCAGCCGCAA TGATCAGGTG GCGACCGATG TGCGACTCTG GACGATGGGT
GCAATCCAGC GCATCGATGA CGGCGTGCGG GCGCTGCAAC AGGCGCTGCT GACGCAGGCA
GAAGCCGCCG GCGACGCGCT GATGCCCGGC TATACGCATC TGCAACGCGC CCAACCGGTG
TTGCTGGCGC ACTGGCTGCT CTCACACTTC TGGTCTGCGC AACGCGACCG TGAACGCCTG
ACGGATTGCG CAAAACGAAC GTCAGTGCTG CCGCTCGGCT CAGGCGCCAT CGCCGGCACG
CCACTGGCAA TCGATCGCGC AGCGCTCGCC GCCGATCTGG GAATGGCAGC GATTTCTCCA
AACAGCATCG ACGCTGTCAG CGATCGTGAT TTCGTTGCCG AATTTCTGTT CTGCGCGGCG
CTGATCGGCA TACATCTCAG CCGTCTGGCG GAAGACATGA TCATTTACAG CAGCGCCGAG
TTCGGTTTCG TCGTTCTCGC CGACGCCTAC AGCACTGGAT CGAGTCTGAT GCCGCAGAAG
AAAAACCCTG ATTCGTTCGA ACTGCTCCGC GGCAAAGCCG GGCGTCTCAC CGGCGACCTG
GTCACGGTGC TGACCGTGCT GAAAGGGATA CCGTCCGCCT ACGACAAAGA CTTGCAGGAA
GACAAAGAGC CGCTGTTCGA CGCCGCCGAC ACCCTCGAAC TGGCGCTGCC GGTCGCTGCC
GGGGCAGTCG CAACGGCTCG CTTCCGTCAC GACCGCATGC GCGCGGCGCT CGATGATGCG
ATGCTGGCTA CCGATGCCGC CGATTACCTG GTGGCGCGCG GCGTACCATT CCGCGAAGCG
CACCATGTGG TTGGCAGGCT GGTGCGTGAG GCGGAGCAAC GTGGGGTTGC GCTCTCGGCG
CTGCCGCTCG ATATACTCCT GGCGGCGCAT CCGGCCTGCG GCAGCGATAT TCTTCAGGTG
TTCGACATGG ATCGCTCTGC GGCGCAGCGT CGCGTTCCGG GCGCAACCGC GCCGGGATCC
GTGCGCGAAC AGATCATCCG GGCGCGGCAG TGTCTTGGGG AACATTGA
 
Protein sequence
MWGGRFDEGI DARMARFNNS FPFDQRMWRE DIRGSMAWAR QLAQAGVIST EERDTLLTGL 
ETVFAEFAND RFEARPTDED IHTAIERRLG ELVGAVAGKL HTGRSRNDQV ATDVRLWTMG
AIQRIDDGVR ALQQALLTQA EAAGDALMPG YTHLQRAQPV LLAHWLLSHF WSAQRDRERL
TDCAKRTSVL PLGSGAIAGT PLAIDRAALA ADLGMAAISP NSIDAVSDRD FVAEFLFCAA
LIGIHLSRLA EDMIIYSSAE FGFVVLADAY STGSSLMPQK KNPDSFELLR GKAGRLTGDL
VTVLTVLKGI PSAYDKDLQE DKEPLFDAAD TLELALPVAA GAVATARFRH DRMRAALDDA
MLATDAADYL VARGVPFREA HHVVGRLVRE AEQRGVALSA LPLDILLAAH PACGSDILQV
FDMDRSAAQR RVPGATAPGS VREQIIRARQ CLGEH