Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_4168 |
Symbol | |
ID | 3936657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 4276418 |
End bp | 4278658 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637906554 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_512110 |
Protein GI | 89056659 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGC TGAAAACGTT CACCCGCCGC GCGTTCCTCG TGGGCTCAGC CGCCGTCGCG GGCGGCGTGG CGTTCGGGAC CTATCTGGTG GCCCGCGACC CGGACAACCC CAATCTCCAG GGCTTGGCGG AGGGTGCGGC CTCCTTCAAC CCATGGGTCC GCATTGACGC GACGGGCATC ACGCTGGTCA CGCCCCACGC CGATATCGGA CAGGGTGCGG CCTCTATGCA GGCCGCTTTG ATCGCGGAAG AGATGGATCT GGACTGGGCC GACTTCGAGA TCGACTTTGG ACGTCCTGCC CCGGCCTATT GGAACACGGC GATGGCGGGC GAAAGTGCGC CCGTGCTGCC GTGGGATGAC AGTTTTGGCG CACAGGCCAT GCGCTCCACC GTCGGGTCCT TGTTGAAGGT AATCGGGATG CAAGGGACCG GCGGCTCCAC CTCCACGCCC GACAGCTACG TGAAGCTGCG CGAAGCCGGG GCCATGGCGC GGGAAACGCT GAAGCTGGCG GCATCGCAGC GCACGGGTAT TTCTTTTACT GAGATGCGGA CAGAGGGGGC GGCCGTCCAT CTGCCCGACG GCACGGCAAT TCCTTACGTG GACCTGGCGA CTGAGGCCGC GGGGTTGGAG CCTGTCCAAG GCACGCCGCT CCGCGATCCG TCCACGTGGC GGTTGGTGGG CAAGGATATG CCACGCATTG ACATTGTCCC GAAATCGACC GGCGCGCCGA TCTACGGGCT CGACATTGAG CTGCCCGATA TGGCCCGCGC AAGCCTGCGA ATGAACCCGC GCAAAGGTGG CGCGCTGAAC GGGTTTGACG CCTCCACCGC CGAAGCGATG CAGGGCGTTG AGCGCGTTCT GGGGATCCCC GGCGGTGTTG CCGTGATCGC GACCAATACC TGGTACGCGA TGCAAGCCCT CGACGCGATT GACTACGATT GGGGTCCTGC CCCCTATGCG CCCGAACAAG CTGACCATTG GGCCGCGCTG GAGCAAGCGT TCACGGAGGA AAACCTCGAT AGCGAATGGC TGAACATCGG GGATGTAGAG GCCGATATCC CAAGTGCGAC GACGATTGAG GCCGAGTATC GCGCGCCCTA CGTGGCCCAT CAGCCACTGG AGCCGCTGAA CGCGGTTGTA TTGGTCGAAG ATGATGGCGC GCAGGTCTGG ACCGGACATC AGATGCCGCG CTTTCTACAG CAGCAAGTGG CCGCAATTAC CGGCCATGAT GCCGATCAGA TCACGCTTCA CAATCAATAT GCGGGCGGCT CTTTCGGGCA CCGTCTGGAG TTTGACTACG TCAAGCAAGC GGTTCAGATC GCAATGCAGA TGCGCGGGCG GCCAGTTAAG CTGACCTATA GCCGCGAGAC GGATTTTGCC CAAGATTTTC CAAGGCAGAT CACGATGGGG CGTGGGCGCG GCGCGGTTCT GGACGGTCAA GTTGTCAGCT TTGATACGCA GATCGCGGCG CCCTCTGTTG TTCGGTCTCA AGTGGGTCGA ATGGGGCAAT CCGTGCCCGG ACCAGACAGT CAGCTAGCCG CCGGTGTCTG GCAACAGCCC TACGGGGTTG AGAACACGCG CATGCGCGCC TATGCGGTGG AAGGGCTGTC GCCCGTCTCG TCCTGGCGCT CGGTCGGGGC ATCGGCCAAT GGGTTCATTG GTGAGGGTTT TCTGGATGAG CTGATCCATG CCGCTGGCGC GGACCCGTTG GAGGAACGTA TTCGCCTGTG TACCCTGGAT GACATCAGCC GCCAGGTCCT GGAGGCCGTC GGAGAGATGT CAAACTGGGG GGAGGCTTTG CCGGAGGGCA CCGGACGTGG GGTCGCGCTG GTTCATGCCT TTGGCGTTCC CTGCGCAGAG GTTGTGGAAG TCACGATGAC CGACGCGGGC ATTCGTTTGA ACACGGTCTG GGTCGCCGCC GATGTCGGTC GTGTCGTCGA TCCGGTGAAC TTCGACAATA TCGTCAAAGG CGGCGTGATC TGGGGCCTCG GCCACGCGAT CAATTCGGAG ATCACCTACA CCGATGGCAT CGCAGATCAG ACGAATTTCC ATGCCCATGA AGGGATGCGG ATGCATCAAA CGCCCGAGAT CTTTGTGCGC GGGTTGGAGA ATGGCCATGT GCGCGGGATC GGAGAGCCTC CGGTGCCACC TGCTCCTCCG GCGCTGGCCA ACGCGATCTT CGCGGCCACC GGTCAGCGCA TCCGAGAGAT GCCGTTCTGG AACCACATTG ATTTCGTCTG A
|
Protein sequence | MGKLKTFTRR AFLVGSAAVA GGVAFGTYLV ARDPDNPNLQ GLAEGAASFN PWVRIDATGI TLVTPHADIG QGAASMQAAL IAEEMDLDWA DFEIDFGRPA PAYWNTAMAG ESAPVLPWDD SFGAQAMRST VGSLLKVIGM QGTGGSTSTP DSYVKLREAG AMARETLKLA ASQRTGISFT EMRTEGAAVH LPDGTAIPYV DLATEAAGLE PVQGTPLRDP STWRLVGKDM PRIDIVPKST GAPIYGLDIE LPDMARASLR MNPRKGGALN GFDASTAEAM QGVERVLGIP GGVAVIATNT WYAMQALDAI DYDWGPAPYA PEQADHWAAL EQAFTEENLD SEWLNIGDVE ADIPSATTIE AEYRAPYVAH QPLEPLNAVV LVEDDGAQVW TGHQMPRFLQ QQVAAITGHD ADQITLHNQY AGGSFGHRLE FDYVKQAVQI AMQMRGRPVK LTYSRETDFA QDFPRQITMG RGRGAVLDGQ VVSFDTQIAA PSVVRSQVGR MGQSVPGPDS QLAAGVWQQP YGVENTRMRA YAVEGLSPVS SWRSVGASAN GFIGEGFLDE LIHAAGADPL EERIRLCTLD DISRQVLEAV GEMSNWGEAL PEGTGRGVAL VHAFGVPCAE VVEVTMTDAG IRLNTVWVAA DVGRVVDPVN FDNIVKGGVI WGLGHAINSE ITYTDGIADQ TNFHAHEGMR MHQTPEIFVR GLENGHVRGI GEPPVPPAPP ALANAIFAAT GQRIREMPFW NHIDFV
|
| |