Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1897 |
Symbol | |
ID | 3747642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2411709 |
End bp | 2416382 |
Gene Length | 4674 bp |
Protein Length | 1557 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637774434 |
Product | Nidogen, extracellular region |
Protein accession | YP_380190 |
Protein GI | 78189852 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACAT TACTGACTAA TCTTGGCGGC ACCCTTGGTT TTGGTGAATA CTATTTAACT CGCAATGACG ATAGCTATAA AAATGGCATT GAAGTAGCAT CGGTTTTTGG AGCTGATGGT CTTAACTTTT TTGGTCGCCA CTATACCTAC TTTTCGGTAA ATAATAATGG CAATATCTCC TTTGCGAATG ATGCAAATTC AGGGTTAAGT ACCTATACGC CCTTTGGCTT ACAAGAGGGT GGTTATGCAC TTATTGCTCC GTTTTTTGCT GATGTTGACA CTCGCTTTTT GAGTGATGCG GCGGCGGAAG CCAATCAAAT AACTCCCACG CCTGACGGTA CATCGCAAGG TTCTAATTTG GTTTGGTACG ATCTTGATCC TGAAGGCAAT AATGGTAAAG GCGTTCTTAC CGTTACATGG GATGATGTTG GCTATTATAG TTACGCCACC GATAAGCTCA ATGCCTTTCA GTTACAGCTT ATTGGGCAGG GCAATGGCAA TTTTGATATC GTGTTCCGCT ACGAAGCAGT GAATTGGACA ACGGGCATTG CCAGCGGTGG TTTGTATGGA TTAGGTGGTA CCGTGGCGCG TGCGGGTTAT TCCACTGGCG ATGGCTCAGC ATGGTATGAG CTACCGCAAT CGGGCAATCA AGATGCTATG CTGTCGCTCG ATACTTCCGC AGGCAATACG GGCGAGGCAG GCTCCTACCT TTTTACCGTG CGCAATAGCC AAGAGGTTGG GGTGCTTAAT GGTACGGAGG GCGATGATTT GCTTGCTGGT TCCACCATGC ACGATACCAT TTATGGCTTT GCAGGCAACG ACTATCTTAT TGGCAACAGT GGCGACGATT TGCTTGTAGG CGGTGCAGGC GATGATACTT ATACGGTTGA CGAAGGGGAT ATTATTACAG AGGAAGTTGA TGGTGGCTTC GACACCATTT TTGCCGCAAC CACCTATACG CTACCAAACA ATGTTGAGGT GCTGCGCTTA ACAGGTGCAG CTTCTGTTAA CGCTATCGGC AACAATGGCG ACAATATTTT TGTAGGCAAT ATCGGCAATA ACCTTTTTGA TGGAGGCGAT GGATTTGATA CGGTTGATTA TTCTCGTAGC CGTAGTGAAA TAACGGTTAA TCTTACCCAA ACAACGCCTT ACTCTATCGG TGGATATGAA GGCAGTGATA CGTTTTTACG TATTGAAAAT CTTTACGGCT CAACTTATAG CGATTATCTT CAAGGCAATA ATAGTGAGAA TATTTTGCGG GGTAATGCGG GGGCTGATAC CTTGCAAGGT AATGGAGGAA ATGATACTCT TGATGGCGGC GATGGTGTAG ATACAATATG GCTCGCTAAT AGTTTTAGTG AATATACTAT AACCTATAAT GCAGCATCAG GTTTTTTGCA ATCGGTGCAT AATAGTACAA TTGAAAGTTC GGATGGTATT GATAGTTTAC GATATGTTGA ATATCTTCGT TTCTCGGATG TACTCTATTG TGTAAAAATT GTAGAAAATG CCCTTGAGCT AACGCGTGAA AATATAGCAC CAAGCTTCAA GACTATTTTA TCTACCGTTG CCTCTACGAA CGAAGATACA CTTGTTGCTA TTACTTTTCA GGATATACTT GTAACAACTG AGTGTGTGGA TATTGATGGC TCAATTACAT CTTTTAATAT TTCAGATTTA CACAGTGGTT CGCTTTGGAT TGGTGCTGAT AGCAACAGTG CGCTGCCTTA CAATTATTAT AGTACAAGTC TGATTGATGC AAACAATAAT GCTTATTGGA AGCCCGATCA AGATGCCAAC GGCTTGCTTG GCGCCTTTAC TGTCGTTGCA TTTGATAACG AAGGTGCAAT AACGGAATCT TCCCATACGC TCAATGTTGA TGTGCTTCCG CAATCTGATG CACCAAGCAT ATCCATTCCC CATCCACTTT TTGATAACCC TCTCACCTAT TCAACCCAAG ATTATCCCAC CGATATAGCG CTTGGCGATC TCAATGGAGA TATGTTAAAG GATATGGTTG TTGTTAATGA AGAGAGCAAC TCTCTTTCTG TATATATCAA TCAAGGTGAT GCTATTTTTG CTCCACAAGA ACTTTATTTC GTTGGTAATA GCACTCGCGG TATTTCATTT ATAGATATCA ATAGCGATGA GGCTTTGGAT ATAGCACTGG CGATTAACGG TGATTGGAAT TCTTATATTT TACTGTTACT TAATAATGGT AGCGGTGAGT TTCATGCTCT ACCAACAACA TTGCCTACGG GTTATTATGC GACCTCTGTT GCAAGTGGCG ATTTTAATGG CGATCAGTTG ATGGATGTAG TGGTTGCTAA CTTTGGGACG TATAGCGTAT CAGTTTTTAT TAATAATGGA AATAATACCT TTACCGCACA AGAGCCGTAT CTCTTAGACG ATAGTCCTTA TGATCTTGTT GCCGTTGATT ATAATGAGGA TGGATCCTTA GATTTGCTTG CTGCAAGCAA CTATGGGAAC AATATTTCCG TTTTAAAAGG TAACGGTAAT GGAGGCTTTA CTGACTGCAA AAACTATCAA GTAGGCGATA ATCCAAAGGC ATTAGCAACT GCCGATTTTA ACGGCGATGG TAAGAGCGAT ATTGCAGTTG CCAATTCAGG CAATAATAGT GTGTCGCTGT TATTGCAGAA TGAGGTTGGT GAATTTATTG CTCCTGCAAC CTATGAGGTT GGCAATAATC CGCAAGCTAT AACAGTAGCC GATCTTAATG GCGATGGTTT CCTTGATTTA CTTACCGCAA ATTATAATAG TAATGCCGTT TCTGTGCTGT TAAATAATGG CGATGCCACC TTTATAGCAC AAGATGATTA TAGTGTTGGC TATGCGCCTA TAGCGTTAGC AAGTAGTGAT GTAAATAGCG ATGGATATGC CGACATAGCG GTTGTTAATT ATCAAGAAAA CTCGGTTTCG GTGCTAACCA ATATATCATT CCTTACAACG TTTTATAGTG GTACTGTGCC GGTTATTGTT AGCCCAACTA TCACAATAGA TGATGCGGAT AATCGGTATG GATGGAATAA TGCAACATTA GGCATTCAAA TTTCAACTCA TGCCGATAGC GACGATATGC TGCATTTACC GCTTACCAAT GCGGGGGATG GATCCATTTG GCTTCATGTG AGTGATGCGG GATATGCGCT TATGGCAGGT GAATTGCAGA TAGCGTCAGC CAATTCAGCA AAGGCTGAGG GCGATGCGGC TTGGATATTT ACCTTTAATG GTGAGGCATC AAGCGAGATG GTGCAAGCCG TTGGGCAAGC AATACTTTTT AGCAATAGCA ACACGCTTGA TTCATTTGCC GAGCGCACGG TTACTTATAC CGTAACCGAT GCCGATGGGC TTTCTTGTAG CGCCTCTCAA ACGGTGAGCG TCATGGTGGA TGAGCTACCT CCAACCTTGC TCTCAGCCTA TCCCACTCAT AACGCCGAAG CTGTTTCAGT TACCGAAACC CTCTTCTTTA CTTTTAGTGA ACCTATTACG CTAAACAGTG GCGCTATTAC GCTCCATGCA GCTTCACCAA CAGGTATCGC GCTTGCTGCC GATATTTCCA TTGTTGGTAA TTCATTGCAA CTTGATCCCC TTATGCCGTT ACAGCCTGAT GTGGAATATT TTGTAACGTT TGAGGAAAAG AGCATTACTG ATCGTGCAGG CAATGCTTTT GTGGAGAGTA TTTCCTATAG CTTTACGACC GAAAGTGCCC CGATTCTCCG CCATGCACTC TCTGGAGTGG TACAATTTTG GAACAGTAAT GATGCCATCA CCGATGTTGC CACCACGCTC ATGACGCTCC CTTATGAGCA TGGCTCCCAA CTTGTGGAGT TCCATAACTT ACAACGTGAT GATGAGAGCT TTAGCGTTGA GGTTTGGATA ACGCCTACGG AAGCACTCCA TAGTGTGCAG CTTCACTTTT TGCTCCCCAC CACCAATGCA ACGCCATCGT GGCAAGAGGC TACAGGGTTG CCGTCAGGAT GGCATATGGC TTATAACTCC AATAATAGTG GAAGTTTTGC GCTGAGTAGT ATGGGTGATG CCGTTATTGA AGCAGGATCG GTAAAATTGG GGCAGCTTAC CTTTGCGGCA CCAAGTGGCA TAGAGCGTGC GGAGTTATTG CTTACGTCGG GGGAATTTGG TTGTGATGGT GGCGAGGATT TGTTAGAGAC GGTGGATATT GCGCCAACAG GCATTGCTTT TAGTTGTGTT GACACGGCAA ACGATGGCAG CTATCATCAT TTTGATATGC TGGAAGGGAG CTACGCCTTG CAAGCCGCAA AAGAGGCTGT ACCAGAGGGG CAAAGTGCTG TAACGGCAGC CGATGCCTAT GCGGCGCTTC AAATGGCAGC AAACATCAAT CCAAACGGCA ATGAGAGCGA GGTTTTGCAA TGGCAATATC TTGCAGCCGA TGTGAACCAT GATGGAAAAG TGAGGGCAGC CGATGCGCTC AATATTTTAA AAATGGCAGT CAATTATGAG GGCGCTCCTG AAGAGGCATG GATTTTTATG CCCGACTATG TTGCTGGCTA TGAAATGACG CGCAGTACAG TTGATTGGTC AGCCGAAGAG ATAGTGGTTG ATTTGTTGAG CAATGAAGCT ATGAGTCTTA TTGGTATTGT GCGTGGCGAT GTAGATGGAA GTTGGATGGG GTGA
|
Protein sequence | MSTLLTNLGG TLGFGEYYLT RNDDSYKNGI EVASVFGADG LNFFGRHYTY FSVNNNGNIS FANDANSGLS TYTPFGLQEG GYALIAPFFA DVDTRFLSDA AAEANQITPT PDGTSQGSNL VWYDLDPEGN NGKGVLTVTW DDVGYYSYAT DKLNAFQLQL IGQGNGNFDI VFRYEAVNWT TGIASGGLYG LGGTVARAGY STGDGSAWYE LPQSGNQDAM LSLDTSAGNT GEAGSYLFTV RNSQEVGVLN GTEGDDLLAG STMHDTIYGF AGNDYLIGNS GDDLLVGGAG DDTYTVDEGD IITEEVDGGF DTIFAATTYT LPNNVEVLRL TGAASVNAIG NNGDNIFVGN IGNNLFDGGD GFDTVDYSRS RSEITVNLTQ TTPYSIGGYE GSDTFLRIEN LYGSTYSDYL QGNNSENILR GNAGADTLQG NGGNDTLDGG DGVDTIWLAN SFSEYTITYN AASGFLQSVH NSTIESSDGI DSLRYVEYLR FSDVLYCVKI VENALELTRE NIAPSFKTIL STVASTNEDT LVAITFQDIL VTTECVDIDG SITSFNISDL HSGSLWIGAD SNSALPYNYY STSLIDANNN AYWKPDQDAN GLLGAFTVVA FDNEGAITES SHTLNVDVLP QSDAPSISIP HPLFDNPLTY STQDYPTDIA LGDLNGDMLK DMVVVNEESN SLSVYINQGD AIFAPQELYF VGNSTRGISF IDINSDEALD IALAINGDWN SYILLLLNNG SGEFHALPTT LPTGYYATSV ASGDFNGDQL MDVVVANFGT YSVSVFINNG NNTFTAQEPY LLDDSPYDLV AVDYNEDGSL DLLAASNYGN NISVLKGNGN GGFTDCKNYQ VGDNPKALAT ADFNGDGKSD IAVANSGNNS VSLLLQNEVG EFIAPATYEV GNNPQAITVA DLNGDGFLDL LTANYNSNAV SVLLNNGDAT FIAQDDYSVG YAPIALASSD VNSDGYADIA VVNYQENSVS VLTNISFLTT FYSGTVPVIV SPTITIDDAD NRYGWNNATL GIQISTHADS DDMLHLPLTN AGDGSIWLHV SDAGYALMAG ELQIASANSA KAEGDAAWIF TFNGEASSEM VQAVGQAILF SNSNTLDSFA ERTVTYTVTD ADGLSCSASQ TVSVMVDELP PTLLSAYPTH NAEAVSVTET LFFTFSEPIT LNSGAITLHA ASPTGIALAA DISIVGNSLQ LDPLMPLQPD VEYFVTFEEK SITDRAGNAF VESISYSFTT ESAPILRHAL SGVVQFWNSN DAITDVATTL MTLPYEHGSQ LVEFHNLQRD DESFSVEVWI TPTEALHSVQ LHFLLPTTNA TPSWQEATGL PSGWHMAYNS NNSGSFALSS MGDAVIEAGS VKLGQLTFAA PSGIERAELL LTSGEFGCDG GEDLLETVDI APTGIAFSCV DTANDGSYHH FDMLEGSYAL QAAKEAVPEG QSAVTAADAY AALQMAANIN PNGNESEVLQ WQYLAADVNH DGKVRAADAL NILKMAVNYE GAPEEAWIFM PDYVAGYEMT RSTVDWSAEE IVVDLLSNEA MSLIGIVRGD VDGSWMG
|
| |