Gene Rcas_2712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2712 
Symbol 
ID5540198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3500753 
End bp3501853 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID640894838 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_001432801 
Protein GI156742672 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC CCTATGTCGA AAAAATGGCG CAAGTGCTGG TGCGCTATTC GCTGGCACTC 
AAGCCAGGCG ATCTGTTTCG CATTATTGCG CCGCCTGCCG CCGCGCCACT GGTGCGCGCG
TTGTACAAAG AAGCGTTGCT GGCGGGCGCC AACCCGTATC TGGCGATGAC CCTTGAAGAG
ACCGATGAAC TGCTTCTGCG CCACGGCTCC GATGCGCAGA TCGGGTACGT CTCACCGATG
ATGCGCCAGG AGATTGAAGA GATCAACGCA ACCATTCGCA TTATGGCGAG TGAGAATACC
CGTGCGCTGT CCGGGATCGA TCCGCGCAAG GTTGCACTGC GTCGCCAGGC ATTGAGTGAC
CTCCAGAAGC GTGTGATGCA GCGCTCCGCC GAAGGAACGC TCAACTGGTG CGTGACGCTC
TTCCCGACCA ACGCCGCCGC ACAGGACGCC GATATGTCGC TGAGCGATTT TGAGGAGTTT
GTCTACCGCG CCTGCAAACT CCACTATAAC GATCCGGTCG CTGAATGGCG TAGAACGCTG
GAGGAACAAC AGCAGATCGC CGATTTCTTA AACACGTGCA GCGTCATTCG CCTGGTTGCG
CCGGATACCG ACCTGACCTA CCGGGTCGCC GGGCGCACCT GGATCAACTG CGCCGGCGAT
CGGAATTTCC CCGATGGTGA GGTCTTCTCC AGTTGCGATG AAACGGCAAC CGAGGGGTAC
ATTCGCTACA CCTTCCCGGC GATCTATGCT GGACGCGAAG TCGAAGACAT CCGTTTGTGG
TTCGAGGATG GTAAGGTTGT GAAGGCGACA GCCGCCAAGG GCGAAGACCT GCTGCACTCG
CTGCTCGCTA TGGACGAAGG AGCACGGCGG TTGGGCGAGG TTGCGTTCGG GACAAACTAC
GACATCACGC GCTTCAGTCG CAATATTCTG TTCGACGAAA AGATCGGCGG CACGGTGCAC
CTGGCGCTCG GCGCCGGCTA CCCGGAGACC GGTTCGCGCA ATACATCGGC GCTCCACTGG
GATATGATCT GCGATATGCG TCAGGGTGAA GCATACGCCG ACGGACGATT GATCTACAAA
GAAGGGAAGT TTCTGATTTG A
 
Protein sequence
MADPYVEKMA QVLVRYSLAL KPGDLFRIIA PPAAAPLVRA LYKEALLAGA NPYLAMTLEE 
TDELLLRHGS DAQIGYVSPM MRQEIEEINA TIRIMASENT RALSGIDPRK VALRRQALSD
LQKRVMQRSA EGTLNWCVTL FPTNAAAQDA DMSLSDFEEF VYRACKLHYN DPVAEWRRTL
EEQQQIADFL NTCSVIRLVA PDTDLTYRVA GRTWINCAGD RNFPDGEVFS SCDETATEGY
IRYTFPAIYA GREVEDIRLW FEDGKVVKAT AAKGEDLLHS LLAMDEGARR LGEVAFGTNY
DITRFSRNIL FDEKIGGTVH LALGAGYPET GSRNTSALHW DMICDMRQGE AYADGRLIYK
EGKFLI