Gene Rcas_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0251 
Symbol 
ID5537713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp308356 
End bp309870 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content62% 
IMG OID640892415 
Productpeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_001430402 
Protein GI156740273 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000018463 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.187628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGAT CACCGAAAGC GCAAGCGTGG TCGGCTTCTC TGGCGCTGAT CGTGGCGATG 
CTCTGGCTCG GCGCCGCACC GCTCTCGGCA GCGCCGAGTG TCAGCGACCC GTTCACCGCG
CCGCTCGATG TCCGGCGGCA GGAAGCAGCG CTTCTGCCCG CCTTCAGTGC CGATCTGAAC
GCCGCCGAGC AATGGGATCG GTATACGGTG ATCGCTCGCG TTGATCCTGA AAAGCGCACA
ATTTCCGGCA GACTGCGCCT GGAATATGCC AATCGCGCTG CCGATCCGCT CGACCGCATC
TACTTTTATC TCTTTCCCAA TCTGCCGGAG TTTGGCGGAC GCCTCGACAT TCACAGCGCC
ACCGTTGACG ATGTGGCAGT CCGGGTACGC TATGAATCGA AAAGGTTTCT GTTGCGGATC
GACCTTCCTG CATCGCTTCC CTCTGGCGCT TCTACTGCTG TCGTCCTCGA TTTCAGCGCC
GCTGCGCCGC TCGATGCCGG TCAGCGCTAT TATGCCGCCT TCAACCGCGA ACGCGGCGTG
CTGGCGCTGG CATCGGCGCT GCCAATGGCG GCGCGGCACG TTGAGGGCGC CTGGCAACTG
GCGACTCCTC TCTTCCGCGG CGATCTTGTG ACCGGTGACA CGGCGCTGTA CGATGTGACG
CTCACCATCC CTGCCGCCTG GATTGCTGTG ACGACCGGGA CGGCAATCGA GAGCCAGAAT
GACGGCGCCG TCCAAACTAC ACGCTTCGTC AGCGGTCCGC AGCGTGATTT TACCATTGTG
CTCACCCGTT TCCCCTCGAT CTCCGCCGAG GTTGATGGCA CGCGCATCAC GTCGTATTTT
CGCCCCGAGA ACCCGGAAGG CGGGCGTGCC GCGCTCGATG CCGCCGTCAA CGCGCTGCGC
GTGTTCAATC GACGATTCGG ACCCTACCCG CTGACGGAAC TCGACATTGT TCAGATCGAT
GCGCGCAAAT TTCTCGGCGT CGAGTACCCT GGTCTGATCA TGATCGACCG CCGATTGTAC
GCTGGCGAGC GCGCAGGTCT GGAGATCATT GTGGCGCACG AAGTGGCGCA TCAGTGGTGG
TATAGCATGG TCGGCAACGA TGTGCAGAAC GAAGCATGGC TCGACGAGGG GTTGACCTCG
TTTACGCAGG TGGTCTATCA GGAAGAACTG CGTGGCGCCG CAGCAGCAGC GCGTGAGATC
GACGGATTCC GCGCGACCTA TCTGCGCGCG CGGCAGACGG GTCGTGATGC GCCGCTGAAG
CGCCCCGTGT CGGCGTTGCG CGGCAATTAT ACTGCTATTG CGTATGCGAA AGGGGCGCTT
TTCTTTCAGG CGTTGCGTGT GCGGATCGGT GAACCGGCGT TCGACCGCTT TTTGCGCGAT
TATTATGCCG CCTTTCGCTA CCGGATTGCG TCGAGCGACG ACGTGCGCGC TGTTGCCGAA
AACGCCTGTG CCTGCGACCT CAACGATTTC TATCGGGATT GGGTGCTGAC GGCTGCGCCG
GTTGCTGTGC CGTGA
 
Protein sequence
MQRSPKAQAW SASLALIVAM LWLGAAPLSA APSVSDPFTA PLDVRRQEAA LLPAFSADLN 
AAEQWDRYTV IARVDPEKRT ISGRLRLEYA NRAADPLDRI YFYLFPNLPE FGGRLDIHSA
TVDDVAVRVR YESKRFLLRI DLPASLPSGA STAVVLDFSA AAPLDAGQRY YAAFNRERGV
LALASALPMA ARHVEGAWQL ATPLFRGDLV TGDTALYDVT LTIPAAWIAV TTGTAIESQN
DGAVQTTRFV SGPQRDFTIV LTRFPSISAE VDGTRITSYF RPENPEGGRA ALDAAVNALR
VFNRRFGPYP LTELDIVQID ARKFLGVEYP GLIMIDRRLY AGERAGLEII VAHEVAHQWW
YSMVGNDVQN EAWLDEGLTS FTQVVYQEEL RGAAAAAREI DGFRATYLRA RQTGRDAPLK
RPVSALRGNY TAIAYAKGAL FFQALRVRIG EPAFDRFLRD YYAAFRYRIA SSDDVRAVAE
NACACDLNDF YRDWVLTAAP VAVP