Gene Cag_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1897 
Symbol 
ID3747642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2411709 
End bp2416382 
Gene Length4674 bp 
Protein Length1557 aa 
Translation table11 
GC content44% 
IMG OID637774434 
ProductNidogen, extracellular region 
Protein accessionYP_380190 
Protein GI78189852 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACAT TACTGACTAA TCTTGGCGGC ACCCTTGGTT TTGGTGAATA CTATTTAACT 
CGCAATGACG ATAGCTATAA AAATGGCATT GAAGTAGCAT CGGTTTTTGG AGCTGATGGT
CTTAACTTTT TTGGTCGCCA CTATACCTAC TTTTCGGTAA ATAATAATGG CAATATCTCC
TTTGCGAATG ATGCAAATTC AGGGTTAAGT ACCTATACGC CCTTTGGCTT ACAAGAGGGT
GGTTATGCAC TTATTGCTCC GTTTTTTGCT GATGTTGACA CTCGCTTTTT GAGTGATGCG
GCGGCGGAAG CCAATCAAAT AACTCCCACG CCTGACGGTA CATCGCAAGG TTCTAATTTG
GTTTGGTACG ATCTTGATCC TGAAGGCAAT AATGGTAAAG GCGTTCTTAC CGTTACATGG
GATGATGTTG GCTATTATAG TTACGCCACC GATAAGCTCA ATGCCTTTCA GTTACAGCTT
ATTGGGCAGG GCAATGGCAA TTTTGATATC GTGTTCCGCT ACGAAGCAGT GAATTGGACA
ACGGGCATTG CCAGCGGTGG TTTGTATGGA TTAGGTGGTA CCGTGGCGCG TGCGGGTTAT
TCCACTGGCG ATGGCTCAGC ATGGTATGAG CTACCGCAAT CGGGCAATCA AGATGCTATG
CTGTCGCTCG ATACTTCCGC AGGCAATACG GGCGAGGCAG GCTCCTACCT TTTTACCGTG
CGCAATAGCC AAGAGGTTGG GGTGCTTAAT GGTACGGAGG GCGATGATTT GCTTGCTGGT
TCCACCATGC ACGATACCAT TTATGGCTTT GCAGGCAACG ACTATCTTAT TGGCAACAGT
GGCGACGATT TGCTTGTAGG CGGTGCAGGC GATGATACTT ATACGGTTGA CGAAGGGGAT
ATTATTACAG AGGAAGTTGA TGGTGGCTTC GACACCATTT TTGCCGCAAC CACCTATACG
CTACCAAACA ATGTTGAGGT GCTGCGCTTA ACAGGTGCAG CTTCTGTTAA CGCTATCGGC
AACAATGGCG ACAATATTTT TGTAGGCAAT ATCGGCAATA ACCTTTTTGA TGGAGGCGAT
GGATTTGATA CGGTTGATTA TTCTCGTAGC CGTAGTGAAA TAACGGTTAA TCTTACCCAA
ACAACGCCTT ACTCTATCGG TGGATATGAA GGCAGTGATA CGTTTTTACG TATTGAAAAT
CTTTACGGCT CAACTTATAG CGATTATCTT CAAGGCAATA ATAGTGAGAA TATTTTGCGG
GGTAATGCGG GGGCTGATAC CTTGCAAGGT AATGGAGGAA ATGATACTCT TGATGGCGGC
GATGGTGTAG ATACAATATG GCTCGCTAAT AGTTTTAGTG AATATACTAT AACCTATAAT
GCAGCATCAG GTTTTTTGCA ATCGGTGCAT AATAGTACAA TTGAAAGTTC GGATGGTATT
GATAGTTTAC GATATGTTGA ATATCTTCGT TTCTCGGATG TACTCTATTG TGTAAAAATT
GTAGAAAATG CCCTTGAGCT AACGCGTGAA AATATAGCAC CAAGCTTCAA GACTATTTTA
TCTACCGTTG CCTCTACGAA CGAAGATACA CTTGTTGCTA TTACTTTTCA GGATATACTT
GTAACAACTG AGTGTGTGGA TATTGATGGC TCAATTACAT CTTTTAATAT TTCAGATTTA
CACAGTGGTT CGCTTTGGAT TGGTGCTGAT AGCAACAGTG CGCTGCCTTA CAATTATTAT
AGTACAAGTC TGATTGATGC AAACAATAAT GCTTATTGGA AGCCCGATCA AGATGCCAAC
GGCTTGCTTG GCGCCTTTAC TGTCGTTGCA TTTGATAACG AAGGTGCAAT AACGGAATCT
TCCCATACGC TCAATGTTGA TGTGCTTCCG CAATCTGATG CACCAAGCAT ATCCATTCCC
CATCCACTTT TTGATAACCC TCTCACCTAT TCAACCCAAG ATTATCCCAC CGATATAGCG
CTTGGCGATC TCAATGGAGA TATGTTAAAG GATATGGTTG TTGTTAATGA AGAGAGCAAC
TCTCTTTCTG TATATATCAA TCAAGGTGAT GCTATTTTTG CTCCACAAGA ACTTTATTTC
GTTGGTAATA GCACTCGCGG TATTTCATTT ATAGATATCA ATAGCGATGA GGCTTTGGAT
ATAGCACTGG CGATTAACGG TGATTGGAAT TCTTATATTT TACTGTTACT TAATAATGGT
AGCGGTGAGT TTCATGCTCT ACCAACAACA TTGCCTACGG GTTATTATGC GACCTCTGTT
GCAAGTGGCG ATTTTAATGG CGATCAGTTG ATGGATGTAG TGGTTGCTAA CTTTGGGACG
TATAGCGTAT CAGTTTTTAT TAATAATGGA AATAATACCT TTACCGCACA AGAGCCGTAT
CTCTTAGACG ATAGTCCTTA TGATCTTGTT GCCGTTGATT ATAATGAGGA TGGATCCTTA
GATTTGCTTG CTGCAAGCAA CTATGGGAAC AATATTTCCG TTTTAAAAGG TAACGGTAAT
GGAGGCTTTA CTGACTGCAA AAACTATCAA GTAGGCGATA ATCCAAAGGC ATTAGCAACT
GCCGATTTTA ACGGCGATGG TAAGAGCGAT ATTGCAGTTG CCAATTCAGG CAATAATAGT
GTGTCGCTGT TATTGCAGAA TGAGGTTGGT GAATTTATTG CTCCTGCAAC CTATGAGGTT
GGCAATAATC CGCAAGCTAT AACAGTAGCC GATCTTAATG GCGATGGTTT CCTTGATTTA
CTTACCGCAA ATTATAATAG TAATGCCGTT TCTGTGCTGT TAAATAATGG CGATGCCACC
TTTATAGCAC AAGATGATTA TAGTGTTGGC TATGCGCCTA TAGCGTTAGC AAGTAGTGAT
GTAAATAGCG ATGGATATGC CGACATAGCG GTTGTTAATT ATCAAGAAAA CTCGGTTTCG
GTGCTAACCA ATATATCATT CCTTACAACG TTTTATAGTG GTACTGTGCC GGTTATTGTT
AGCCCAACTA TCACAATAGA TGATGCGGAT AATCGGTATG GATGGAATAA TGCAACATTA
GGCATTCAAA TTTCAACTCA TGCCGATAGC GACGATATGC TGCATTTACC GCTTACCAAT
GCGGGGGATG GATCCATTTG GCTTCATGTG AGTGATGCGG GATATGCGCT TATGGCAGGT
GAATTGCAGA TAGCGTCAGC CAATTCAGCA AAGGCTGAGG GCGATGCGGC TTGGATATTT
ACCTTTAATG GTGAGGCATC AAGCGAGATG GTGCAAGCCG TTGGGCAAGC AATACTTTTT
AGCAATAGCA ACACGCTTGA TTCATTTGCC GAGCGCACGG TTACTTATAC CGTAACCGAT
GCCGATGGGC TTTCTTGTAG CGCCTCTCAA ACGGTGAGCG TCATGGTGGA TGAGCTACCT
CCAACCTTGC TCTCAGCCTA TCCCACTCAT AACGCCGAAG CTGTTTCAGT TACCGAAACC
CTCTTCTTTA CTTTTAGTGA ACCTATTACG CTAAACAGTG GCGCTATTAC GCTCCATGCA
GCTTCACCAA CAGGTATCGC GCTTGCTGCC GATATTTCCA TTGTTGGTAA TTCATTGCAA
CTTGATCCCC TTATGCCGTT ACAGCCTGAT GTGGAATATT TTGTAACGTT TGAGGAAAAG
AGCATTACTG ATCGTGCAGG CAATGCTTTT GTGGAGAGTA TTTCCTATAG CTTTACGACC
GAAAGTGCCC CGATTCTCCG CCATGCACTC TCTGGAGTGG TACAATTTTG GAACAGTAAT
GATGCCATCA CCGATGTTGC CACCACGCTC ATGACGCTCC CTTATGAGCA TGGCTCCCAA
CTTGTGGAGT TCCATAACTT ACAACGTGAT GATGAGAGCT TTAGCGTTGA GGTTTGGATA
ACGCCTACGG AAGCACTCCA TAGTGTGCAG CTTCACTTTT TGCTCCCCAC CACCAATGCA
ACGCCATCGT GGCAAGAGGC TACAGGGTTG CCGTCAGGAT GGCATATGGC TTATAACTCC
AATAATAGTG GAAGTTTTGC GCTGAGTAGT ATGGGTGATG CCGTTATTGA AGCAGGATCG
GTAAAATTGG GGCAGCTTAC CTTTGCGGCA CCAAGTGGCA TAGAGCGTGC GGAGTTATTG
CTTACGTCGG GGGAATTTGG TTGTGATGGT GGCGAGGATT TGTTAGAGAC GGTGGATATT
GCGCCAACAG GCATTGCTTT TAGTTGTGTT GACACGGCAA ACGATGGCAG CTATCATCAT
TTTGATATGC TGGAAGGGAG CTACGCCTTG CAAGCCGCAA AAGAGGCTGT ACCAGAGGGG
CAAAGTGCTG TAACGGCAGC CGATGCCTAT GCGGCGCTTC AAATGGCAGC AAACATCAAT
CCAAACGGCA ATGAGAGCGA GGTTTTGCAA TGGCAATATC TTGCAGCCGA TGTGAACCAT
GATGGAAAAG TGAGGGCAGC CGATGCGCTC AATATTTTAA AAATGGCAGT CAATTATGAG
GGCGCTCCTG AAGAGGCATG GATTTTTATG CCCGACTATG TTGCTGGCTA TGAAATGACG
CGCAGTACAG TTGATTGGTC AGCCGAAGAG ATAGTGGTTG ATTTGTTGAG CAATGAAGCT
ATGAGTCTTA TTGGTATTGT GCGTGGCGAT GTAGATGGAA GTTGGATGGG GTGA
 
Protein sequence
MSTLLTNLGG TLGFGEYYLT RNDDSYKNGI EVASVFGADG LNFFGRHYTY FSVNNNGNIS 
FANDANSGLS TYTPFGLQEG GYALIAPFFA DVDTRFLSDA AAEANQITPT PDGTSQGSNL
VWYDLDPEGN NGKGVLTVTW DDVGYYSYAT DKLNAFQLQL IGQGNGNFDI VFRYEAVNWT
TGIASGGLYG LGGTVARAGY STGDGSAWYE LPQSGNQDAM LSLDTSAGNT GEAGSYLFTV
RNSQEVGVLN GTEGDDLLAG STMHDTIYGF AGNDYLIGNS GDDLLVGGAG DDTYTVDEGD
IITEEVDGGF DTIFAATTYT LPNNVEVLRL TGAASVNAIG NNGDNIFVGN IGNNLFDGGD
GFDTVDYSRS RSEITVNLTQ TTPYSIGGYE GSDTFLRIEN LYGSTYSDYL QGNNSENILR
GNAGADTLQG NGGNDTLDGG DGVDTIWLAN SFSEYTITYN AASGFLQSVH NSTIESSDGI
DSLRYVEYLR FSDVLYCVKI VENALELTRE NIAPSFKTIL STVASTNEDT LVAITFQDIL
VTTECVDIDG SITSFNISDL HSGSLWIGAD SNSALPYNYY STSLIDANNN AYWKPDQDAN
GLLGAFTVVA FDNEGAITES SHTLNVDVLP QSDAPSISIP HPLFDNPLTY STQDYPTDIA
LGDLNGDMLK DMVVVNEESN SLSVYINQGD AIFAPQELYF VGNSTRGISF IDINSDEALD
IALAINGDWN SYILLLLNNG SGEFHALPTT LPTGYYATSV ASGDFNGDQL MDVVVANFGT
YSVSVFINNG NNTFTAQEPY LLDDSPYDLV AVDYNEDGSL DLLAASNYGN NISVLKGNGN
GGFTDCKNYQ VGDNPKALAT ADFNGDGKSD IAVANSGNNS VSLLLQNEVG EFIAPATYEV
GNNPQAITVA DLNGDGFLDL LTANYNSNAV SVLLNNGDAT FIAQDDYSVG YAPIALASSD
VNSDGYADIA VVNYQENSVS VLTNISFLTT FYSGTVPVIV SPTITIDDAD NRYGWNNATL
GIQISTHADS DDMLHLPLTN AGDGSIWLHV SDAGYALMAG ELQIASANSA KAEGDAAWIF
TFNGEASSEM VQAVGQAILF SNSNTLDSFA ERTVTYTVTD ADGLSCSASQ TVSVMVDELP
PTLLSAYPTH NAEAVSVTET LFFTFSEPIT LNSGAITLHA ASPTGIALAA DISIVGNSLQ
LDPLMPLQPD VEYFVTFEEK SITDRAGNAF VESISYSFTT ESAPILRHAL SGVVQFWNSN
DAITDVATTL MTLPYEHGSQ LVEFHNLQRD DESFSVEVWI TPTEALHSVQ LHFLLPTTNA
TPSWQEATGL PSGWHMAYNS NNSGSFALSS MGDAVIEAGS VKLGQLTFAA PSGIERAELL
LTSGEFGCDG GEDLLETVDI APTGIAFSCV DTANDGSYHH FDMLEGSYAL QAAKEAVPEG
QSAVTAADAY AALQMAANIN PNGNESEVLQ WQYLAADVNH DGKVRAADAL NILKMAVNYE
GAPEEAWIFM PDYVAGYEMT RSTVDWSAEE IVVDLLSNEA MSLIGIVRGD VDGSWMG