Gene Franean1_4382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4382 
Symbol 
ID5672735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5227800 
End bp5229218 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content68% 
IMG OID641243251 
Productphosphoenolpyruvate carboxykinase 
Protein accessionYP_001508668 
Protein GI158316160 
COG category[C] Energy production and conversion 
COG ID[COG1274] Phosphoenolpyruvate carboxykinase (GTP) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCA CGATCCCGGG CCTACAGCCG ACGCCGACGA CACACCCGGC ACTGTTGGAG 
TGGGTCGCCA CGATCGCCGA CCTCACCCGG CCCGACCGGG TTCACTGGTG CGACGGCAGC
GACGCCGAGT ACGACCAGCT CTGCGCGGAG CTCGTCGACA AGGGCACGTT CCTCCGTCTC
GCCGAGGACA AGCGGCCCGG CAGCTACTAC GCCGCGAGCG ACCCCAGCGA CGTCGCCCGC
GTCGAGGACC GCACCTTCAT CTGCTCGAGG AGCCAGGACG ACGCCGGTCC GACGAACAAC
TGGACCGACC CGGACGAGAT GCGCATCACC CTGCGGGGCC TGTTCGCGGG TTGCATGCGG
GGCCGCACCA TGTACGTCGT CCCGTTCTGC ATGGGATCGC TAGGCTCACC GATCTCCGCA
CTCGGCGTCG AGATCACCGA CTCGGCCTAC GTCGCGGTCT CGATGCGTGT AATGACCCGA
ATGGGCCAAC CGGCACTCGA CCAGCTCGGA CAGGACGGCT TCTTCGTCCC CGCCGTGCAC
AGCGTCGGCG CGCCGCGCCA GCCCGAGCAA CCCGACGTCG CCTGGCCCTG CAACGCCACC
AAGTACATCG TCCACTTCCC CGAGACACGA GAAATCTGGA GCTACGGCTC CGGCTACGGC
GGCAACGCCC TGCTCGGCAA GAAGTACTAC GCGCTACGGA TCGCCTCGGT GATGGCCCGC
GACGACGGCT GGCTCGCCGA GCACATGCTG ATCCTCAAGC TCACCGGACC CGACGGGAAC
ACCCATTACA TCGCGGCCGG CTTTCCGAGC GCCTGCGGCA AAACCAACCT CGCCATGCTC
GTCCCGACCA TCCCCGGCTG GAAGGTCGAG ACCATCGGGG ACGACATCGC CTGGATGCGC
TTCGGAGACG ACGGACGGCT CTACGCCGTC AACCCCGAGG CCGGCTTCTT CGGCGTCGCG
CCGGGCACCG GCCGGACGAC CAACCCCAAC GCCATCGACA CGATCCACAG CAATGCGATC
TTTACGAATG TCGCGCGCAC CGATGACGGA GACGTGTGGT GGGAAGGGCT GACCAAGGAA
CCCCCGGCAC ATCTCATCGA CTGGCAGGGC CGCGACTGGA CACCACAGTC CGCGACGCCG
GCCGCGCATC CCAACGCCCG TTTCACCGCC CCCGCCAGCC AATGCCCGAC GATCGCTGCG
GAATGGGCCG GCCCGGCGGG CGTTCCGATC TCCGTTGACT GTGCCGCCTG GGAACACGAA
AACCGACACG ATCCGTCGAT ACTTCACCGA CCTCGGCCCC CGCATGCCCG ACGCTCTCTG
GGTCGAACTC GCGGCCCTCG CCGACCGGCT GCGCTGACAC CGTCTCCCAC GCCTCACACC
CTCCGTGAGC CAACGGGTTC GGCCGCCGCC ACAGCCTGA
 
Protein sequence
MPVTIPGLQP TPTTHPALLE WVATIADLTR PDRVHWCDGS DAEYDQLCAE LVDKGTFLRL 
AEDKRPGSYY AASDPSDVAR VEDRTFICSR SQDDAGPTNN WTDPDEMRIT LRGLFAGCMR
GRTMYVVPFC MGSLGSPISA LGVEITDSAY VAVSMRVMTR MGQPALDQLG QDGFFVPAVH
SVGAPRQPEQ PDVAWPCNAT KYIVHFPETR EIWSYGSGYG GNALLGKKYY ALRIASVMAR
DDGWLAEHML ILKLTGPDGN THYIAAGFPS ACGKTNLAML VPTIPGWKVE TIGDDIAWMR
FGDDGRLYAV NPEAGFFGVA PGTGRTTNPN AIDTIHSNAI FTNVARTDDG DVWWEGLTKE
PPAHLIDWQG RDWTPQSATP AAHPNARFTA PASQCPTIAA EWAGPAGVPI SVDCAAWEHE
NRHDPSILHR PRPPHARRSL GRTRGPRRPA ALTPSPTPHT LREPTGSAAA TA