Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3762 |
Symbol | |
ID | 6411440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4036765 |
End bp | 4038678 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642713643 |
Product | amino acid adenylation domain protein |
Protein accession | YP_001992736 |
Protein GI | 192292131 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGCA CCGTGGTTAT CAAAGAAGGA TCTGGCAGCG GCGCGGTTGC GGCATGCACC GTTCCACAGG TTCTTGAGCA GATTGCCCAG AGTTTTCCGG AATCCATTGC CGCCATTTCC GAGCGTGGGC GGATTACCTT TGCCGAACTT GAATTTCGTT CGAACCAATT GGCACGATTG CTTGTCAAAA GAGGCGTGAA GGTTGGCGCA ACTGTCGTCC TCATGACGGG GCGCTCGATC GATACGTTGA TCGGCATGAC CGCGATTTTG AAAGCGGGCG GAGTCTATAT GCCCCTTGAT GTCGGGCTAG GTCCGGAAGC GATCAGCGGC GCCATCCAGG ATGCGCAGCC GGCGCTTGTT TTGACGGAGC ATCAACCTGC TTTGATTGAC GCGATCGAGC AGCGCCGCCT GAGTGACGAG CTGGATGCCT CGCGACTTGA GCCCGGTGAT CCGCTGGCGC TCGAGCTGAC GCCATCCTTG CCGGCCTATG TGATGTTCAC GTCGGGCTCG ACCGGGCGGC CGAAGGGCGT TGTGGTGCCG CATCGGGCCA TCGTCAGGCT GGTGGTTGAT ACGGATTTCA TGACGTTGTC GCCCGCGACC GTGATGCTGC ATGCTGCGCC ATTGGCGTTC GATGCCTCGA CGTTGGAAAT CTGGGGGCCG TTGCTGAACG GTGGGCAGAT CGTCATCGTC GAAGATGCGG TGCTGTCGGT CGATCGGATC GCCGAGACGC TAGGGCGTTT CTCGGTCAAC GCGGCCTGGC TCACAGCCGG TCTGTTCCAT CTGATGGTGG ACGAGCGGCC CGAGGCGCTG TCGGGGCTGA CGACCCTGCT GGCCGGCGGC GACGTTCTTT CGCCGGCACA CGTTCGTCGT GCGATGGCAC TGTTGCCGGA CTGCACCGTC GTCAACGGGT ATGGGCCCAC TGAGAATACG ACGTTCACGT GCTGCTATTC GATCCCGCGC ACCGGTTGGG GCGACGGACC CGTCCCGATC GGATTTCCAA TTTCGGGCAC CAGCGTCCAC ATTCTATCGG ACACGCTCGA GCCGGTTGCG GACGGCGAGG AAGGGCAGCT TTGCGCTGGC GGGATCGGGC TCGCGCTGGG GTACCTCAAT CGCCCAGAGC TGACCGCGGA GAAATTCATC GTCGATCCGT TGTCTGATGA CCCTGCGGCT CGTCTCTATC TGACCGGGGA TTACGTCCGG CGCCGGAGCG ACGGGGCCAT CGAGTTCCGT GGCCGCCGGG ACCGTCAGGT TAAGATCAAC GGCGTGCGTA TTGAGCTCGA TGGGGTGGAG CAGGCGCTCC GGCAGGATCC GGTCCTGGCC GACGCGGCGG TGGTTCTGTC CGCGGATCGA GGCGACGCAA AGCGGATTGT CGCATTCTTG AAGCCACTGC CGGGTGACGC TGCCGGTGAT CTTGAAGCCG GCGTCATCCG CAGGCTCAAA GAGCAGTTTC CCGCGCAAGC GATACCGTCA ACGATCAAGG TCGTGGATGA GTTGCCGCTC AACAAGAACG GCAAGATCGA CAGAGCGAAG CTGCTGAGCG ACCTGATCGC GACGGAGCAG CTGGGCGCAG GTACCGACGA GGAAGACTTC TCTGACGACA GTGTCGGCTC GATCGTCGCC GATATTTGGA GCGCATTGCT GGGCAAGTCG GTCGACGCCA GGGCAAACCT GTTCGATCTC GGCGCGACAT CACTCCAGAT GATCGCGGCG CACGAGCGCA TCCAGGCGGC GACCGGCCTT CGTTTTCCAG TCACGGACCT GTTCGCCCAT CCCAGCATCG CCGAGTTCCA GGCTTGTCTG GACGGAGCCT CGCATCGGTC GCTCGCCATT GCCGGAGCGG CGCGTGGCCG CCGGCAGAGG CATGCGATGA ATGCGGCGTA TGGAGCGATC AACGTTCGGA GCCATCACGC GTAG
|
Protein sequence | MASTVVIKEG SGSGAVAACT VPQVLEQIAQ SFPESIAAIS ERGRITFAEL EFRSNQLARL LVKRGVKVGA TVVLMTGRSI DTLIGMTAIL KAGGVYMPLD VGLGPEAISG AIQDAQPALV LTEHQPALID AIEQRRLSDE LDASRLEPGD PLALELTPSL PAYVMFTSGS TGRPKGVVVP HRAIVRLVVD TDFMTLSPAT VMLHAAPLAF DASTLEIWGP LLNGGQIVIV EDAVLSVDRI AETLGRFSVN AAWLTAGLFH LMVDERPEAL SGLTTLLAGG DVLSPAHVRR AMALLPDCTV VNGYGPTENT TFTCCYSIPR TGWGDGPVPI GFPISGTSVH ILSDTLEPVA DGEEGQLCAG GIGLALGYLN RPELTAEKFI VDPLSDDPAA RLYLTGDYVR RRSDGAIEFR GRRDRQVKIN GVRIELDGVE QALRQDPVLA DAAVVLSADR GDAKRIVAFL KPLPGDAAGD LEAGVIRRLK EQFPAQAIPS TIKVVDELPL NKNGKIDRAK LLSDLIATEQ LGAGTDEEDF SDDSVGSIVA DIWSALLGKS VDARANLFDL GATSLQMIAA HERIQAATGL RFPVTDLFAH PSIAEFQACL DGASHRSLAI AGAARGRRQR HAMNAAYGAI NVRSHHA
|
| |