Gene Franean1_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1374 
Symbol 
ID5669782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1659000 
End bp1661270 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content71% 
IMG OID641240300 
Producthypothetical protein 
Protein accessionYP_001505727 
Protein GI158313219 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGGATG GCCCGCCGCC AGGTCATGAC CGCTGGTCGC GCGACCGGTG GAGTGGACGG 
ATCCGGGTCA CCGCGACCAC GGTCAGCCCC TTGCTGATCG TGGAGGACCG GGACCGGCGC
CCGTCCGCGG TTCCGCAGTT GCGGGTGCCG TTGGCGGCCG ACCCGGCGGA CCCCGCGCTG
CGGCGGGTCG ACATCGCGCC GACCCAGATC AAGGGCATGC TCCGCGCGGC CTTCGAAGCC
GTCACGAACT CCCGCTTCGG AGTGGTCTCC CGCGCGCACC ACCGGCCGTT GCACTACCGG
ACGTCGGCCA GCAGCGGCAG CGGGCTGCGG CCCGTGGTCG TCGCCCCCAC GGCAACCGGG
GTGGCCCTGC ACGTCGGTGG CACTCTCACC GTGCCCGGGA TGGATACCCT GCCCGGCGCG
GTCCTCCCCG CCTGGAGTCA CGACGCCGGC GACCACCAGC GCCCGCCGAC CACCCTGCTC
GCCGGCGCGG CACCCCACCG GCCGCCGCTC GCCAACGGTC AGCACGTCGC CGCCCTGGTC
GAACAGGTCA CGATTCAGGT TCCCGGACCG AACAGAACCA CTCGTGCGAT GGCACGCTGG
CACGTCACCG ACTGCCTGCC GCTCTCCCCC GGAGCCGACC TCGAAGAGGC CATCAGCCGC
CTCACCGCCC ACGGGCAGCG GCGCTGGGCC CAGGTGACCG GCTACCTGCA CATCACCGGC
CCGACCATCA AGGGCAAGCA GTACGAACGC CTGTTCATCG ACACCTGCGT GCACGCGGAC
CCGACGGTCG CCTTCGACCC GCCGGTGACG ATCGACGACA ACGTCGACGC CCTGCTCACC
GCCCTGCAGA ACCTGATCGA CGACCAGCGG GCGGCCCATC TCGTCGCCCG CAAGGACGAG
ATCTGGCAGC GCAGCGATGA CACCAACGGC ACCCACTCAC CCTGGGACTA CCTCGGGCCC
GACCCGGGTG ACACGGCGTG GGCACGCCAC CTGTACGACA CCGCCGATGC CACCAAGCAT
GGCCGGACGC CACCTGCCTG GACCGGCGTC GACCTCTCCC CGGCCCGTCC CGAGGAGCCG
CGGACAGCGT CGCGTTTCAC CTGCTGGGCC GAGTACGACG CGACCGGCGC ACGGCTGCTC
CGCCTACGCC CGGTAATGAT CAGCCGCCAG GGCTACGAGA AAAGCCCCAT CGATCTCCTC
AACCCGGCCT GGCGACTGCG GCCGGCGACC AGCCTGGACG ATCTCTCCCC GGCAGACCGT
GTCTTCGGCT GGGTCGCCCC GACCAGTCGC GACGACGAGC CGGCCACCAC CCGCGGGCAG
TCGCGCCGCG CGCACCGCGG CCAGCTACGG GTGATCAACG TGCTCGGCCC GCCCGCCGAC
ACGGTCGACG TGCAACCCAC CGCGCTGGTC CTGCCGATCC TGTCGACGCC GAAACCGACG
CAGGGACGGT TCTATCTCGG CAAGATCAAA GGAGCCGGCC CGGTCCAACC ACTCGACCAG
GGCACGACCC GCGCCCGTCA CTTCGCCGAC GGCCAGACCT GGCGGGGTCG CAAGGTCTAC
CTCTACCAGC GCCGCGATCT TCCGCAACGT CGCGCACCCA GCGGCGTGCG CTACCCCGAC
CAACCCAGAA CACAGAGCAG CGAACTACAC GAGTGGGTGC GGCCGGGTGT CACCTTCAGC
TTCGAACTCG CCGTCGACAA CCTCTCCGAC GCCGAACTCG GCGCGCTGCT GTGGCTGCTC
GACCCGCATC ACCTGGGCCG GGCCAGCAAT CCGGATGCCA CGCGACCCGA ACCCGCCCGA
CCGGGCCGAA TGCGGCTGGG TCTCGGCAAG CCGCACGGGT TCGGCGTCGT CGAGGTACGC
CTCGACCCCG ACGCGACCCG CCTGGCGACC GGCGCTGACA TCACGGAACG GTTCACCTCC
CTGACCGATA CCCGCCCGAA CAACCCGGGC TGGGCCGGCC TCGCGCAGAA GTTCGAAGCC
CTGGCCGGCC CGGTGCTCGA ACCCGCCATC ACCGCCGTTC GCGTCGCCGC CGCCGGGGTA
TCCGGCCCGG TGCACTATCC GCGGCAGAAT GCCGAGGATG TCGACAAGGG CTACGAATGG
TTCGTCGCGA ACGAACGCGC CGCCGAAGCC CGGGCCAAGC ACGACGCGAA CACGGTGAGA
AGCCGGCCGG CAGCAGGCAA GTCCCGCCCC CAGGAAGACC GCTCCCTACC GCTCCTGCAC
GGCCAGGACC GCCTCTCGCT GGACCCCATC GCGGAGAAGG ACTACCGATG A
 
Protein sequence
MSDGPPPGHD RWSRDRWSGR IRVTATTVSP LLIVEDRDRR PSAVPQLRVP LAADPADPAL 
RRVDIAPTQI KGMLRAAFEA VTNSRFGVVS RAHHRPLHYR TSASSGSGLR PVVVAPTATG
VALHVGGTLT VPGMDTLPGA VLPAWSHDAG DHQRPPTTLL AGAAPHRPPL ANGQHVAALV
EQVTIQVPGP NRTTRAMARW HVTDCLPLSP GADLEEAISR LTAHGQRRWA QVTGYLHITG
PTIKGKQYER LFIDTCVHAD PTVAFDPPVT IDDNVDALLT ALQNLIDDQR AAHLVARKDE
IWQRSDDTNG THSPWDYLGP DPGDTAWARH LYDTADATKH GRTPPAWTGV DLSPARPEEP
RTASRFTCWA EYDATGARLL RLRPVMISRQ GYEKSPIDLL NPAWRLRPAT SLDDLSPADR
VFGWVAPTSR DDEPATTRGQ SRRAHRGQLR VINVLGPPAD TVDVQPTALV LPILSTPKPT
QGRFYLGKIK GAGPVQPLDQ GTTRARHFAD GQTWRGRKVY LYQRRDLPQR RAPSGVRYPD
QPRTQSSELH EWVRPGVTFS FELAVDNLSD AELGALLWLL DPHHLGRASN PDATRPEPAR
PGRMRLGLGK PHGFGVVEVR LDPDATRLAT GADITERFTS LTDTRPNNPG WAGLAQKFEA
LAGPVLEPAI TAVRVAAAGV SGPVHYPRQN AEDVDKGYEW FVANERAAEA RAKHDANTVR
SRPAAGKSRP QEDRSLPLLH GQDRLSLDPI AEKDYR