Gene Francci3_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3897 
Symbol 
ID3906665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4661105 
End bp4662934 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content68% 
IMG OID637881223 
Productphosphoenolpyruvate carboxykinase 
Protein accessionYP_482976 
Protein GI86742576 
COG category[C] Energy production and conversion 
COG ID[COG1274] Phosphoenolpyruvate carboxykinase (GTP) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.627986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAACCG CGGCACAGGT TTCGGGGCTG GACGTGGCCC CCACGAGGCA CGCCAGGCTG 
GTTGCCTGGG TGCGCGAGAT TGCCGAACTC ACTCAGCCGG ATCGGGTTGA GTGGTGCGAT
GGATCAGAGG CGGAGTTCGA TCGGCTGACC TCCCTTCTGA TCGAGCAGGG GACCCTGGTC
CGGCTGAATG ACGAGAAACG TCCGAACAGC TTTTACGCCG CTTCGGACCC GGGCGACGTG
GCCCGCGTGG AGGACCGCAC CTACATCTGC TCCGAGCAGG CGGAGGACGC CGGTCCGACC
AACAACTGGC TGGACCCTGC CGAGATGCGG GCCACGCTCC GGGGCCTGTT CGCGGGGTGC
ATGCGGGGTC GGACGATGTA CGTCGTCCCG TTCTGCATGG GTCCGCTCGG CTCGCGCATC
TCGGCCCTCG GTGTCGAAAT CACCGACTCG CCTTACGTCG TCATCTCCAT GCGGACGATG
ACCCGGATGG GAACGCCCGC CCTCGAGCAG CTTGGAACGG ACGGTTTCTT TGTCCCGGCC
GTCCATTCCC TGGGCGCTCC GCTGGAACCC GGCCAGGCCG ATGTTCCGTG GCCGTGCAAT
ACGACGAAGT ACATTACCCA TTTCCCGGAA ACCCGGGAGA TCTGGTCCTA CGGATCGGGT
TACGGCGGAA ACGCACTGCT CGGCAAGAAG TGCTATGCGC TGCGTATTGC GTCCGTCATG
GCCCGCGACG AGGGCTGGCT CGCCGAGCAC ATGCTCATCC TCAAACTGAC GTCTCCCGCC
GGCAAGGTCC ACTACATTGC CGCCGCATTT CCGTCGGCCT GCGGAAAGAC CAATCTGGCG
ATGCTCATCC CAACCCTGCC GGGCTGGCAG GCGGAGACTG TCGGTGACGA CATCGCCTGG
ATGCGCTTCG GTGAGGACGG CCGGCTGTAC GCGGTCAACC CGGAGGCCGG GTTCTTCGGG
GTCGCCCCGG GGACCGGCGA GCAGACCAAC CCCAACGCGG TCAGGACCCT CTGGGGCAAC
GCCATCTACA CCAACGTGGC CAGGACCGAT AACGGTGACG TCTGGTGGGA GGGGCTGACC
AAGCAGCCGC CGGCGCACCT CATCGACTGG AAGGGCCGCG ACTGGACCCC CGAGTCCAGC
GAGCCCGCCG CACACCCCAA CGCCCGCTTC ACCGTCCCGG CGGGCCAGTG CCCGACGATC
GCCCCGGAGT GGGAGGATCC GCGCGGCGTG CCGATCTCGG CGATCCTGTT CGGCGGGCGC
CGGGCGACCG CGGTGCCGCT GGTCACCGAG GCTCCCGACT GGCGGCGCGG GGTGTTCTTC
GGCTCGATCG TCGCCTCCGA GACGACGGCG GCCCAGGCCG GAGCGATCGG CAAGCTGCGC
CGGGACCCCT TCGCGATGCT GCCGTTCTGC GGCTACAACA TGGCCGACTA CTTCGCCCAC
TGGCTCGAGG TCGGGCAGAA GGCCGACCAG ACGAAGCTCC CGCGGGTCTA CTACGTGAAC
TGGTTCCGCA AGAGCCCGGA GGGCAGGTTC CTGTGGCCGG GATTCGGCGA CAACAGCCGT
GTCCTCGCCT GGATCGTCGG TCGCCTGGAG GGCACGGCAG CCGGGGTGGA GACGCCGCTC
GGCGTCCTGC CGACGAAGGA CGCCCTACCC GTTGACGGCA TCGACATCGC CGAGGAGGAC
CTCGAGACCC TGCTCACGGT GGACGTCGAG GTGTGGAAAC AGGAGGCCCA GCTGATCCCC
GAGCACTACC AGACGTTCGG GGAACGGCTT CCGGCCGCCC TGTGGACCGA GCACGAGGCG
CTTGTCGAGC GGCTGAACAG CGCAGACTGA
 
Protein sequence
MTTAAQVSGL DVAPTRHARL VAWVREIAEL TQPDRVEWCD GSEAEFDRLT SLLIEQGTLV 
RLNDEKRPNS FYAASDPGDV ARVEDRTYIC SEQAEDAGPT NNWLDPAEMR ATLRGLFAGC
MRGRTMYVVP FCMGPLGSRI SALGVEITDS PYVVISMRTM TRMGTPALEQ LGTDGFFVPA
VHSLGAPLEP GQADVPWPCN TTKYITHFPE TREIWSYGSG YGGNALLGKK CYALRIASVM
ARDEGWLAEH MLILKLTSPA GKVHYIAAAF PSACGKTNLA MLIPTLPGWQ AETVGDDIAW
MRFGEDGRLY AVNPEAGFFG VAPGTGEQTN PNAVRTLWGN AIYTNVARTD NGDVWWEGLT
KQPPAHLIDW KGRDWTPESS EPAAHPNARF TVPAGQCPTI APEWEDPRGV PISAILFGGR
RATAVPLVTE APDWRRGVFF GSIVASETTA AQAGAIGKLR RDPFAMLPFC GYNMADYFAH
WLEVGQKADQ TKLPRVYYVN WFRKSPEGRF LWPGFGDNSR VLAWIVGRLE GTAAGVETPL
GVLPTKDALP VDGIDIAEED LETLLTVDVE VWKQEAQLIP EHYQTFGERL PAALWTEHEA
LVERLNSAD