Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3480 |
Symbol | |
ID | 8744100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3582929 |
End bp | 3583969 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646514061 |
Product | Capsule synthesis protein, CapA |
Protein accession | YP_003405015 |
Protein GI | 284166736 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCATC GAATCGGCTT CACCGGCGAC GTCATGCTCG GTCGACTGGT CGACGACCGC CAGCGCCGGC GGTCGGTCGA CGCGGTCTGG AGGAACGTCC TCGAGCGCCT CCGGGACCTC GATACACTCG TGATCAACCT CGAGTGCGTG CTGTCGACGC GCGGGAGCGA GTGGCGGCGG ACCCACCGGC CGTTTCACTT CCGCGCGGAT CCCGACTGGG CCGTGCCGGT CCTCGAGCGC GCCGGCATCG ACGTCTGTGC CCTCGCGAAC AACCACGTGC TGGACTACGA GGCGGTGGCA CTGCGGGATA CGCTCGAGAA CCTCGACGAG GCCGGAATCG AACGCGCCGG CGCGGGAGAG ACGGTCGACG CGGCGCTCGA GCCGGCGGTC GTAACCGTCG ACGGCGACAG GGGCGACGAA ACCGACGGCC TCGAGTTAGC CGTCGTTTCG TTCACCGACA ACACGCCCGA GTACGCGGCC GACGAGGAGT CGCCCGGAAC CGCGTGGATC GAAATCGACC GCGACGACGA ACGGACGCGA ACCAGGGTTC GCGAGGCGCT CTCCCGCGCC CGCGAAACGG ATCCCGATCT GCTGGTCGCG TCGCTGCACT GGGGGCCGAA CATGGTCACC GAACCGCCCG ACTCGTTCCG GGCGTTCGGC CGCTGGTTGG TCGAGGAGGG CGTCGACCTG ATCCACGGCC ACAGCGCCCA CGTCTTCCAG GGGATCGAGG TCTACGATGG CGCACCGATC ATCTACGACG CGGGCGATTT CGTCGACGAC TACGCGGTCG ACGACGAGTT GCGCAACGAC CGCGGGTTCC TGTTCGAACT CGCGGTGACC GAGGACGGGA CGCCGACCGA ACTGCGGCTC CATCCCACGG AAATCGACGG CTGTGCCGTC CACGAGGCGA GCCCCGATGC CGCCCGGTGG GCTCGCGACC GGATGCGCGA GCTGTCGGCC CCCTTCGGGA CCGCGTTCGA CCGCGACGGC GAGACGCTGG TGCTGGCGCT CGAGTCGGAG CCTCGTACCG CACTCGAGTA G
|
Protein sequence | MSHRIGFTGD VMLGRLVDDR QRRRSVDAVW RNVLERLRDL DTLVINLECV LSTRGSEWRR THRPFHFRAD PDWAVPVLER AGIDVCALAN NHVLDYEAVA LRDTLENLDE AGIERAGAGE TVDAALEPAV VTVDGDRGDE TDGLELAVVS FTDNTPEYAA DEESPGTAWI EIDRDDERTR TRVREALSRA RETDPDLLVA SLHWGPNMVT EPPDSFRAFG RWLVEEGVDL IHGHSAHVFQ GIEVYDGAPI IYDAGDFVDD YAVDDELRND RGFLFELAVT EDGTPTELRL HPTEIDGCAV HEASPDAARW ARDRMRELSA PFGTAFDRDG ETLVLALESE PRTALE
|
| |