Gene Saro_1883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1883 
Symbol 
ID3917104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1988558 
End bp1989931 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content69% 
IMG OID640444627 
ProductTPR repeat-containing protein 
Protein accessionYP_497157 
Protein GI87199900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00380027 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG CCGATGCGCA TCTGCGGCGG GCTGTCGCGC TGCGCGATTC CCAGGACCTC 
GCCGGGGCGC TCGATGCCAT ACGAGAGGCA GCGAGTGCCG CGCCAGGGGA CGCCAATGTC
GCTTTAGGTG TCGCTCAGGT CACATTCGAA GCCGGGCTCG ATGCGGCCGA TCTCTATGGG
CGGGCGGCAG AACTTGCCCC GGACCGCCTC GACCTCCAGC GCAGCCGCGC CAGCGCGCTG
GCGGCAGAGG GACGGCAGCC GGAGGCCGAG GAGCTGCTGG AAGCACTGCT GGCAAGGCAT
CCCGCATGGA TCGACGGCCA TCGCTGCCTG TGCGGCATGC GGGCGACTGC CGGCGAGGTC
GATTTCGCGC GGAGCTTTCG GAGCGCCGTG GCGCGCGAGC CCGAGAATTT CGGACTGCGC
ATGGCATGGT TCCACGTCCT TGCCACTGCT CGGCTTTGGG ACGAGGCAAG AGCGGTCGTC
GACGAGGCCG AGGCCCTGCT GGGAGAGCGG CAGGCGTCGC TTCTGGGCAA GCTGTTCATT
GCCAGCGAGA GCGGGGAAGA GGCTGCAAAC CCAAGCCTTT TCGACCGGGT GGAGCACGTG
CAGGACCTCG GGCTCGACAT CGCCCGGGTG CGGCATTTCC TGCGCGGCGG CCAGATCGAG
CGGGCGCGGG ATCTGTGCGT GCGACACATG GGCCAGCCCA CCATGCGCGC GTTCTGGCCC
TACGCCTCGC TCGCCTGGCG CCTCCTCGAC GATCCGCGCG CGCAGTGGCT CGACGCAGGC
ATGCGCCATG TGCGCGCGTT CGACCTGGAC TTCCGCGCGG AGGAGCTCGC CGCGCTGGCG
GAGACGCTGC GGCGGCTGCA TACGATGCGA CAGCCCTGGC ACGAACAATC CGTGCGCGGG
GGCACGCAGA CCGAGAGGCC GTTGCTGCTG CGGATCGATC CGGTGATCGC CAGTGCCAGG
GCGCGGATCG AGGCCGCGGT GCGGCGATAC ATCGATGAAT TGCCCGATCA CGATCCCGCC
CATCCATTGC TGGCGGCACC CCGCAAAGGC CTCCTCTTTT CGGGGAGCTG GTCGGTCAGG
CTGCGCCCCG GCGGCTTCCA TTCGGTGCAC ACCCATCCGA TGGGATGGCT CAGCTCGGCG
CTTTACGTGA CGGTGCCAGA GCGGGAACAG CGCGGCGCAG CGCCCGCCGG GCACCTGCGC
TTCGGTACGC CGCCGCCCGA ACTGGCCCTG CCGCTGGAGG CCTACGGTGA GGTAGTGCCT
GTGCCGGGGC GCCTGGCCCT TTTCCCTTCG ACCATGTGGC ACGGCACCGT CCCCTTCGCG
GATGGCGAAC GGATGACCAT AGCCTTCGAC ATCGTACCCA ACCTGAAAGC CTGA
 
Protein sequence
MSTADAHLRR AVALRDSQDL AGALDAIREA ASAAPGDANV ALGVAQVTFE AGLDAADLYG 
RAAELAPDRL DLQRSRASAL AAEGRQPEAE ELLEALLARH PAWIDGHRCL CGMRATAGEV
DFARSFRSAV AREPENFGLR MAWFHVLATA RLWDEARAVV DEAEALLGER QASLLGKLFI
ASESGEEAAN PSLFDRVEHV QDLGLDIARV RHFLRGGQIE RARDLCVRHM GQPTMRAFWP
YASLAWRLLD DPRAQWLDAG MRHVRAFDLD FRAEELAALA ETLRRLHTMR QPWHEQSVRG
GTQTERPLLL RIDPVIASAR ARIEAAVRRY IDELPDHDPA HPLLAAPRKG LLFSGSWSVR
LRPGGFHSVH THPMGWLSSA LYVTVPEREQ RGAAPAGHLR FGTPPPELAL PLEAYGEVVP
VPGRLALFPS TMWHGTVPFA DGERMTIAFD IVPNLKA