Gene Francci3_4065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4065 
Symbol 
ID3907026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4864790 
End bp4868008 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content69% 
IMG OID637881394 
Productglycine dehydrogenase 
Protein accessionYP_483144 
Protein GI86742744 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACG TCACCCAGTA CGACGACACC CCCTATGCCG ATGGACCGGC CACGGCTGCC 
GATGGACCGG CCACGACCGC CACGACCGCC ACCATCAGTC CCGCGTCGTC GCACGCGCGG
TCGGGGGCGC CGCGACAGGG CAGTGCCGCG GCGGTGGGCA GGAACGGTGC GAGGCGGCTG
CCCGCCGCGG CCATCCCGCG GTTCGCCGAT CGGCACATCG GCCCGGACCC GTCATCCCAA
CGGGAGATGC TGGATGCGCT GCGGGTGGAA TCACTGGCCG CGCTCACCGA CGCGGCCGTC
CCGGCAAGCA TCCGTGACCA TGATCTGGAT CTTCCCGCGG CGCTGAGCGA GCCCGCGGTG
CTCGCGGCCC TACGCGCGTT CGGTAGCAGG AACCGACCGG TTTCCTCGAT GATCGGGCTG
GGATTCCATC CCGCGGTGAT GCCGGGAGTC ATCCAGCGCA ACGTCCTGGA GAACCCCGCG
TGGTATACGG CGTACACGCC GTACCAGCCG GAGATCTCCC AGGGCCGCCT GGAAGCGTTG
CTCAACTTCC AGACGATGAT CACTGATTTG ACCGGCCTTG CGGTGGCCGG AGCCTCCCTG
CTGGACGAGC CGACCGCGGC GGCCGAGGCG ATGCAGATCG CGTTTCGGAC GGCGAAGGGT
TCCCGCGCGA CGTTCCTCAT CGATGCCGAC ACCCTCCCGC AGACCGTCTC GGTGGTCGCG
ACCAGGGCGG AGGCGCTGGG GATCAATGTG GTGGTCGCGG ATCTCGCAGC CGGACTCACG
GCCGTGGGCC CGGCGCAGCT GGATGCGGCC TTCGGCCTGC TGCTGTCCTA TCCCGGGCCG
GCGGGCGTCC TGCGGGATGT GCGTGCTGTG ATCGCATCGG CGAGGGAGCG CGGCATCGTC
GTCACGATCG CCGCCGATCC GCTGGCGCTG ACCCTGCTGC GGGCGCCTGG CGACCTCGGC
GCGGACATCG CGGTGGGGAG CACGCAGCGG TTCGGACTAC CGCTGTCCTT CGGAGGCCCG
CACGCCGGCT ACCTCGCAGT GCGCAAGGGG CTGGAACGGT CCCTGCCGGG GCGGCTCGTC
GGGGTGTCGG TCGATGCGGA CGGCGCACCC GCGTACCGGC TCACCCTGCA GACCCGTGAG
CAGCACATCC GACGGGAGAA GGCGACGAGC AACATCTGCA CGGCCCAGGT GCTGCCCGCG
GTACTCGCCT CCATGTACGC GGTCTACCAC GGCCCGGAGG GCCTGGCCGG GATCGCGCGT
CGCATCCACG GGCACGCGGT GCGCCTCGCC GAAGGCCTGC GCGCCGCCGG TGTGACGGTG
GTGCACGACG CGTTCTTCGA CACCGTCCTC GCCGCGGTGC CGGGCCGGGC CACCCAGGTG
GTTGCGGACG CGTTGGCGCG GGGCGTGAAC CTGCGGCTGG TCGACGACGA CCACGTGGGT
ATCTCCTGCA ACGAGACGAC GGGCCAGGCC GAGCTGGAGG CCGTCCGGTC GGCCTTCGGG
GTGGGCCCGG AGGCGACGGC GGGCATCACT TGGACCGGGA CCGGGACCGG GACCGAGATC
CGGACCGGGG CCTGGACCGG GTCCGTCGCG GCGGCGAACG ATCATCTCGT GGCCCGCGCG
GACCAGGCGG ACCAGGCGGA CCAGGCGGAC CAAGCGGCGG ACGAGCCGGT GGCGCTGCCC
GCCGAACTGG TGCGTACGGA TCCCTACCTG CAGCATCCGG TGTTCCACGA CCATCGGTCG
GAGACCGCGA TGCTCCGCTA TCTGCGTCGA CTGTCCGATC TTGATCTGGC TCTTGACCGG
GGCATGATCC CCCTGGGGTC GTGCACGATG AAACTCAACG CGACCACTGA GATGGCCGCG
GTCACCTGGC CGGAATTCGC TGACATCCAT CCATTTGCCC CGTTGGACCA GGCCGCCGGA
TACCTCGCGA TGATTCAGGA CCTGGAGCGT TGGCTCGCGC AGATCACCGG ATATGCGGGG
GTCTCCCTCC AGCCGAATGC CGGCAGCCAG GGTGAGCTCG CCGGGCTGCT CGCCATCCGG
GCCTATCACC GCGATCATGC TGTTCCCGGA TCCGTGGTGC GAAACATCTG TCTCATTCCC
TCCTCGGCGC ACGGGACGAA TGCGGCGAGC GCCGCGATGG CGGGAATGCG GGTGGTCGTC
GTCTCCTGTG ACGACGACGG AAACGTCGAC CTGAACGATC TGGCCCGCAA GGCCCGCGCG
AACGCGGACG CCTTGGCCGC GCTGATGGTC ACCTACCCGT CGACCCATGG TGTGTACGAG
GAGGGCATCG GGCAGGCGTG CGCGATCGTG CATGAGGCCG GCGGTCTGGT GTACGTCGAC
GGAGCGAATC TCAACGCTCT GGTGGGGCTC GCCAGGCCGG GGCAGTTCGG GGCCGACGTG
AGCCACCTGA ACCTGCACAA GACGTTCTGC ATCCCGCATG GGGGCGGGGG TCCGGGGGTC
GGCCCGGTGG CCGTGGTCGA GAAACTCCTG CCCTATCTGC CGAACCACCC GCTGCGGCCG
GAGGCAGGAC CGGCCACCGG AGTGGGCCCG ATCTCGGGAT CCCCGTGGGG CTCGGCCGGA
ATTCTTATGA TTCCGTGGGC CTACATTCGG ATGATGGGGG CGGACGGCCT GCGCCGGGCA
ACCTCGGTGG CCGTCCTGAA CGCCAACTAC ATTGCCCACC GGCTGCATCC GTACTATCCG
GTGCTCTACG CGGGCCGGGA CGGGCTGGTC GCCCATGAGT GCATTCTGGA CCTACGGCCG
TTGACGAAGC TGACCGGCGT CACCGTGGAC GACGTGGCGA AGCGTCTCAT CGACTACGGT
TTCCATGCTC CGACCATGTC ATTCCCGGTT GCCGGAACAC TGATGGTCGA GCCGACGGAG
AGTGAGGATC TCGGCGAGAT CGATCGTTTC TGCGACGCAA TGATCTCCAT TCGGGCCGAG
GCGGACAAGG TCGGAGACGG TATCTGGCCG CGGACCGACA ATCCCCTGCA TAATGCTCCG
CATACTGCAC AGATGGTTAC CGCCAACGAA TGGTCACACG CTTATCCGCG ATCGGTGGCG
GCCTATCCCG TCGCTTCGCT GCGGGCCGCC AAGTACTGGC CCCCGGTACG TCGGATCGAC
GGCGCCTATG GTGATCGCAA CCTGGTCTGC ACCTGCCCAC CGGTGGGATC CTTCGCCGCG
GAGCCGGTCG ACGAGCAGAT TCTCGCCGGA GCCCGCTGA
 
Protein sequence
MPDVTQYDDT PYADGPATAA DGPATTATTA TISPASSHAR SGAPRQGSAA AVGRNGARRL 
PAAAIPRFAD RHIGPDPSSQ REMLDALRVE SLAALTDAAV PASIRDHDLD LPAALSEPAV
LAALRAFGSR NRPVSSMIGL GFHPAVMPGV IQRNVLENPA WYTAYTPYQP EISQGRLEAL
LNFQTMITDL TGLAVAGASL LDEPTAAAEA MQIAFRTAKG SRATFLIDAD TLPQTVSVVA
TRAEALGINV VVADLAAGLT AVGPAQLDAA FGLLLSYPGP AGVLRDVRAV IASARERGIV
VTIAADPLAL TLLRAPGDLG ADIAVGSTQR FGLPLSFGGP HAGYLAVRKG LERSLPGRLV
GVSVDADGAP AYRLTLQTRE QHIRREKATS NICTAQVLPA VLASMYAVYH GPEGLAGIAR
RIHGHAVRLA EGLRAAGVTV VHDAFFDTVL AAVPGRATQV VADALARGVN LRLVDDDHVG
ISCNETTGQA ELEAVRSAFG VGPEATAGIT WTGTGTGTEI RTGAWTGSVA AANDHLVARA
DQADQADQAD QAADEPVALP AELVRTDPYL QHPVFHDHRS ETAMLRYLRR LSDLDLALDR
GMIPLGSCTM KLNATTEMAA VTWPEFADIH PFAPLDQAAG YLAMIQDLER WLAQITGYAG
VSLQPNAGSQ GELAGLLAIR AYHRDHAVPG SVVRNICLIP SSAHGTNAAS AAMAGMRVVV
VSCDDDGNVD LNDLARKARA NADALAALMV TYPSTHGVYE EGIGQACAIV HEAGGLVYVD
GANLNALVGL ARPGQFGADV SHLNLHKTFC IPHGGGGPGV GPVAVVEKLL PYLPNHPLRP
EAGPATGVGP ISGSPWGSAG ILMIPWAYIR MMGADGLRRA TSVAVLNANY IAHRLHPYYP
VLYAGRDGLV AHECILDLRP LTKLTGVTVD DVAKRLIDYG FHAPTMSFPV AGTLMVEPTE
SEDLGEIDRF CDAMISIRAE ADKVGDGIWP RTDNPLHNAP HTAQMVTANE WSHAYPRSVA
AYPVASLRAA KYWPPVRRID GAYGDRNLVC TCPPVGSFAA EPVDEQILAG AR