Gene Franean1_5060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5060 
Symbol 
ID5673396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6058881 
End bp6060071 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content76% 
IMG OID641243911 
Productprephenate dehydrogenase 
Protein accessionYP_001509326 
Protein GI158316818 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.042313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCCG GCGAGCCCCC CACGGCGGCC GGGCGGTTCT CGCCGCCCTA CGTCCGCCCG 
GGGGTCGAGG CGGCCGCAGG GGGCTCGTCC GCGTGGGATC CGGCGCACCT GCCCGAGCTG
CGCCGGGTGG CCGTCGTCGG GTCCGGGCTG ATCGGCACGA GCATCGGGCT GGCGCTGTCC
GGGCGCGGCG TGGAGGTGTT CCTGCGTGAC TCCGACGACG CCCAGGTGAA GCTCGCCGAG
GCGATGGGCG CCGGCCGGCC ATGGCAGGGC GAACGGGTCG ACCACGCGGT GATCGCCACC
CCGCTGCCCA CCGTCGCCGC CGAGCTGCGC GACCTGCAGC GCGGCGGCCT GGCGACGACG
GTCAGCGACG CCGGCAGCGT GAAGACCCGC CCGCTGGTCG AGGCCGTCCA GCTCGGCTGT
GATCTCGGGG CCTGGTGCCC GGCCCATCCG ATCGCCGGGC GGGAGCGGCA CGGGGCGGTG
TCCGCCCGCG CGGACCTGTT CGCCGAGCGG GTGTGGGCGG TCTGCCCGGT GGCCCACACC
GGCGCGGACG CGATCGCGGC GACCGCCGCC CTCGCCCTCG CCTGCGGCGC GACACCGGTG
CGCACCACCC CCGAGCGCCA CGACGCCGCG ATGGCCGTCC TCTCGCACGT TCCGCAGCTG
GTGGCGAGCG TGCTCGCCGG GAGCCTGCTC GGCCTCGACT CGCACGACCT GCCGTTCGCC
GGCCAGGGCT TCCGCGACAC GACCCGCCTC GCCGACAGCG ACCCCGTCCT GTGGGCGTCG
ATCATCGAGG GCAACCGCGG GCCCATCGCC GAGCGCGTGC GCCGGCTGGG GCGGGAGTTC
ACCCACCTCG CGGACGTGCT CGCCGAGGGG ACCCGTGACG AGGTGGTCGA GGCGGTCACG
GCGGCGATCC ACGGCGGGCG GCACGGCCGG TCGCTGCTGC CCCGCAAGGC CGGTGCCCGG
GCACTTCCGT GGGGCTGGGT CGGTGTGGTG CTCGACGACC GTCCCGGCCA GCTCGCGGCG
CTGTTCGCCG TGATCGGCGA GTGGGACGTC AACATCGAGG ACGTCGGGCC GTTCGAGCAC
AGCCTGGACG CCCCCGCCGG CATCGTCGAG ATCGCGGTCG ATCCGGACGG CGCGGACGGA
CTCGTCGAAC GGCTGACGCG GGCCGGATGG ACGGCATATC GGCGCTCGTG A
 
Protein sequence
MSPGEPPTAA GRFSPPYVRP GVEAAAGGSS AWDPAHLPEL RRVAVVGSGL IGTSIGLALS 
GRGVEVFLRD SDDAQVKLAE AMGAGRPWQG ERVDHAVIAT PLPTVAAELR DLQRGGLATT
VSDAGSVKTR PLVEAVQLGC DLGAWCPAHP IAGRERHGAV SARADLFAER VWAVCPVAHT
GADAIAATAA LALACGATPV RTTPERHDAA MAVLSHVPQL VASVLAGSLL GLDSHDLPFA
GQGFRDTTRL ADSDPVLWAS IIEGNRGPIA ERVRRLGREF THLADVLAEG TRDEVVEAVT
AAIHGGRHGR SLLPRKAGAR ALPWGWVGVV LDDRPGQLAA LFAVIGEWDV NIEDVGPFEH
SLDAPAGIVE IAVDPDGADG LVERLTRAGW TAYRRS