Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_3447 |
Symbol | |
ID | 3935921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 3492199 |
End bp | 3495129 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637905821 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_511389 |
Protein GI | 89055938 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.54882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTTG ATCACAAGGG CATCATCGAC CGCTCCGCCC CCGTCCGTTT CCATTTCAAC GGCGCGCCCT ACAGCGGCTT CAAGGGGGAC ACCGTCGCCT CTGCCCTAAT GGCCAATGGG GTAAAGCTGG TGGGTCGGTC CTTCAAGTAT CATCGCCCGC GCGGGGTCCT GACCGCCGGA TCGGAGGAGC CGAACGCCCT GATACAGGTC GGCGAAGGCG ACGCGATGCT GCCCAATGTC CGCGCCACGG TGCAGGAGGT TTTCGCGGGG CTGGAGACGG CGTCACAAAA CCACCTCGGG CCGCTGGCCT ATGACCTGCT CAGCGTCAAC GATTTGTTCC CCAACTTCCT GAGCGCGGGC TTTTATTACA AGACATTCAT GTGGCCCCGC TCCTTCTGGG AACGTGTCTA CGAGCCCGCA ATCCGCCGCG CCGCCGGTCT GGGGCGGATG ACGGAAGGGG CATCGCCCGA CATCTCGGAG CGCGCCTTCG CCTTCTGCGA CGTGCTGGTG ATCGGCGGCG GCCCCACCGG CCTCATGGCC GCGCTCACCG CTGCAATGTC TGGCGCAGAC GTCATCCTGG TCGAGGAAAC CGCCGATCTG GGCGGGCGAG TGTTATCCGA CGGAGAGGTC ATCGACGGTC TGCCGGCGGA CGTCTGGGTG GGCGAAACCG TGCAAAAACT GCACGCCACG GGCCGCGTGC GGATCATGAC CCGCACCACG GCGACGGGCG TCTATGACGG TCTGACCTTT GGGGCCGTGG AACGCGTGGG CAGCCATCTG CCCCGCGCCG ATCATCTGCC GCAGGAATGC TTCTGGCGTA TCCGGGCCGG TCAGGCGGTG TTGGCCGCGG GCGCGCTGGA ACGGCCCATC GCCTTCCCCA ACAACGACCG ACCCGGCATC ATGATGGCCT CGGCCATCCG CACCTATATC AACCGCTATG GCGTGACACC CGGCGAGAAG GTCGCGATCT TCGCCAGCAA CGACGACGCC CACAAAACCG CGCTTGACCT GCAGGACGCG GGCGTTGACG TTGCCGCCGT CATCGACAGC CGCGAAGATG CACAAGCCAT GGGCGACTAT CGTCTGTATA CCGGCGCGCA GGTCGTGGGC AGCAAGGGAC GTCAGGCCCT GCGCGAGATC ACGGTTCAGC GCGGCAGTTC CACCTTCAAG GTGGAGGCCG ATTGTCTTGG TATCTCAGGC GGTTGGAACC CTACCCTGCA CCTCACCTGC CATTTGGGCG CACGGCCCGT ATGGGATTCG GGCATCCATG CTTTCGTGCC CAAAGAGGCC GCGATAGCGG GCTTGCGCCC CGCAGGGGCC TGCGCCGGGA CATTTTCATC CGAAGGATGC ATCAAAGATG GCATCGCCGC CGCGGCAGCG GCGCTGAAAG CGCTCGGGTT GCGCGTGCGA AAGCCCGATC TGCCGCAATC CGAGGGCGCC GCAGGCGCCA CCGCCCCCCT TTGGAGCGTC AGCGCAAAAG GCCGCGCCTG GCTCGACTTC GCCAATGACG TCACCACCAA GGACGTGAAA CAATCCGCCG CCGAGGGCTT CAAATCCGTC GAACACATGA AGCGCTACAC CACGCAGGGC ATGGCACCCG ATCAGGGCAA ATCCTCCAAC ATAGGCGCGC TTGCCGTCCT TGCTGACGCG ACCGGCAAGG GCATTCCAGA GACCGGCACC ACCACCTATC GCCCGCCCTT CACGCCCGTG GCGCTCGGCA CCCTGGCCGC CGGGGCGCAG GGCAAGGGCT TCGCGCCGGA ACGCTTCACC ACCTCGGACA AGGCCGCCCG CCAGCGTGGC GCGCCGATGA TCGCCGTGGG CCTCTGGTAT CGCGCCTCCT ACATGCCCCA GCCCGGCGAA ACCCATTGGC GGCAATCCTG CGACCGGGAG GTCAACATGG TCCGCAATGC CGTGGGCGTG GTCGACGTCT CCACCCTCGG CAAGATCGAG ATCTTCGGCG CGGATGCGGG CGCATTTCTC GATTTCTTAT ATACGAACAC CTTCTCCACC CTCAAACCGG GCCGCGCGCG CTATGGTCTG ATGCTGCGCG AAGACGGGCA TGTCATGGAT GACGGCACCA CCGCCTGCCT TGCCGACAAC CACTACGTGA TGACCACCAC CACCGCCGCC GCCGGTCCGG TGATGGCGCA TATGGACTTC GCGAGCCAGG TCCTGCGCCC GGATCTGGAT GTCGCCTTCA CGTCCGTGAC GGAACAATGG GCGCAGTTCT CCGTCGCCGG TCCCCACGCC CGCACCCTGA TCAACGGCGT GCTCGACCAG CCCATCGACG GTGACAGCTT TCCGTTCATG CAATGCGGCG CGGTCCGCGT TCACGGCGTC CCCGGTCGCC TCTTCCGCAT CTCCTTCTCG GGCGAACATG CCTACGAGGT CGCCGTGCCC GCGGCCTATG GCGACGCGCT CTACCGCGAC CTTGTGGCCC GCGCCGAGGC GTTGGGCGGC GGCGCCTACG GGATGGAGGC GCTCAACGTG CTGCGGATCG AGAAGGGGTT CATCACCCAT TCGGAGATCC ATGGCCGCGT CACCGCGTTC GATGTCGGCA TGCAGGGGAT GATGTCGAAA AAGAAAGACT TCATCGGCAA GGCGGCGGCG ACACGTCCGG GCCTGTTGGA GGCGGATCGC GAACGGCTCA TCGGCCTCAA ACCCACCGGG GCGGTGAAGG AGTTGACCGC AGGTGCGCAT CTGTTCAACA CCGATGACGC TCCAACGCGG GAAAACGACC AGGGCTACGT CACCTCCGTC GGCTATTCGC CCACCCTTGG CCATTTCGTC GGCCTTGGCT TCCTGCGTGA CGGGCCGAAC CGCGTCGGGC ATCATATGAT GATGGTCGAC CACCTGCGGG GCGTCACGGC GGCGGTCGAA ATCTGTGATC CGGTCTTCTT TGACCCCGAC GGAGGGCGCG CCCGTGGCTG A
|
Protein sequence | MRLDHKGIID RSAPVRFHFN GAPYSGFKGD TVASALMANG VKLVGRSFKY HRPRGVLTAG SEEPNALIQV GEGDAMLPNV RATVQEVFAG LETASQNHLG PLAYDLLSVN DLFPNFLSAG FYYKTFMWPR SFWERVYEPA IRRAAGLGRM TEGASPDISE RAFAFCDVLV IGGGPTGLMA ALTAAMSGAD VILVEETADL GGRVLSDGEV IDGLPADVWV GETVQKLHAT GRVRIMTRTT ATGVYDGLTF GAVERVGSHL PRADHLPQEC FWRIRAGQAV LAAGALERPI AFPNNDRPGI MMASAIRTYI NRYGVTPGEK VAIFASNDDA HKTALDLQDA GVDVAAVIDS REDAQAMGDY RLYTGAQVVG SKGRQALREI TVQRGSSTFK VEADCLGISG GWNPTLHLTC HLGARPVWDS GIHAFVPKEA AIAGLRPAGA CAGTFSSEGC IKDGIAAAAA ALKALGLRVR KPDLPQSEGA AGATAPLWSV SAKGRAWLDF ANDVTTKDVK QSAAEGFKSV EHMKRYTTQG MAPDQGKSSN IGALAVLADA TGKGIPETGT TTYRPPFTPV ALGTLAAGAQ GKGFAPERFT TSDKAARQRG APMIAVGLWY RASYMPQPGE THWRQSCDRE VNMVRNAVGV VDVSTLGKIE IFGADAGAFL DFLYTNTFST LKPGRARYGL MLREDGHVMD DGTTACLADN HYVMTTTTAA AGPVMAHMDF ASQVLRPDLD VAFTSVTEQW AQFSVAGPHA RTLINGVLDQ PIDGDSFPFM QCGAVRVHGV PGRLFRISFS GEHAYEVAVP AAYGDALYRD LVARAEALGG GAYGMEALNV LRIEKGFITH SEIHGRVTAF DVGMQGMMSK KKDFIGKAAA TRPGLLEADR ERLIGLKPTG AVKELTAGAH LFNTDDAPTR ENDQGYVTSV GYSPTLGHFV GLGFLRDGPN RVGHHMMMVD HLRGVTAAVE ICDPVFFDPD GGRARG
|
| |