Gene Rcas_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0200 
Symbol 
ID5537661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp243637 
End bp244752 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content62% 
IMG OID640892363 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001430351 
Protein GI156740222 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCAT CGATCACCGG GTTGCTGCGC CCCGACATTG CGGCGCTCGA ACCATACACG 
CCGATTGTTC CGCTTGAAAC GCTCGCCGAG CGACTCGGTC TGCCGGTCGA ACGCATCATC
AAACTCGACG CCAACGAAAA CCCCTACGGT CCATCACCGC GCGCGCTGGC GGCGCTTGCC
GCCGTCGAAC ACGATGCCCC TCATCGCTAC GCCATCTACC CCGACCCGGA TCATACCCGC
CTGCGCGCCG CCCTCAGCCG GTACGTCGGT CAACCGCCAG AACGTATTAT CTGCGGCGCG
GGGTCCGATG AACTGATCGA CCTGATCATG CGTGCCGTCC TGCGTCCTGG CGATGTCATG
GTCGATTGCC CGCCGACCTT TGCCATGTAC AGTTTCGATG CGGCGCTCTA CGGCGCGCGT
ATCGTTGCGG TTCCGCGCAC CGAACAGTTC GATGTCGATG TCGAGGGAGT TGCGGAAGCG
GTCGAGCGTG ATGGCGCAAA ACTGCTGTTC CTGGCGGCGC CGAACAACCC GACTGGAACG
CCGCTGGCGC GCACTACGGT CGAGCGTTTG CTCGATCTGC CGATCATCCT GGCGATTGAT
GAAGCCTATG CCGAATTTGC CGGGACGAGC GTTATCGATC TGGTTGGCAC GCGCCCCAAT
CTGGTCGTCC TGCGCACCTT CAGCAAATGG GCGGGGCTTG CGGGGCTGCG CATCGGTTAT
GCGGCAATGC ACGAAGACGT GATTACGTAC CTGTGGAAGA TTAAGCAACC GTACAATGTC
AATGTCGCCG CCGAAGTCGC CGCAGTTGCG TCACTCGACG ATCTGGACGA GCGGCTGTCC
ACTGTCGCGC GTATTGTCGC CGAGCGCGAA CGCCTGGCGG CTGCGTTGGC GGCGCTGCCT
GGCATTCACG TCTACCCCAG TGCGGCGAAC TTCCTGCTCT GTCGGATGAC CAGTGGTGGC
GCTGCGCGCG CCCGCGCCAT CCGCGACACC CTGGCGCAGC GTGGGATTCT GATCCGCTAC
TTCAACCGAC CAGGGATCGA CGATTGCATT CGTATCAGCG TCGGACGCCC GGAGCAAAAC
GACGCCCTAT TGGATGTGCT AAAGGAAGTA GCATAG
 
Protein sequence
MPASITGLLR PDIAALEPYT PIVPLETLAE RLGLPVERII KLDANENPYG PSPRALAALA 
AVEHDAPHRY AIYPDPDHTR LRAALSRYVG QPPERIICGA GSDELIDLIM RAVLRPGDVM
VDCPPTFAMY SFDAALYGAR IVAVPRTEQF DVDVEGVAEA VERDGAKLLF LAAPNNPTGT
PLARTTVERL LDLPIILAID EAYAEFAGTS VIDLVGTRPN LVVLRTFSKW AGLAGLRIGY
AAMHEDVITY LWKIKQPYNV NVAAEVAAVA SLDDLDERLS TVARIVAERE RLAAALAALP
GIHVYPSAAN FLLCRMTSGG AARARAIRDT LAQRGILIRY FNRPGIDDCI RISVGRPEQN
DALLDVLKEV A