Gene Franean1_6615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6615 
Symbol 
ID5674930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8049714 
End bp8051399 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content70% 
IMG OID641245466 
Productplasmid pRiA4b ORF-3 family protein 
Protein accessionYP_001510858 
Protein GI158318350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCG TGAGCCGCGG TCGCAACGAC AGGAAGAAGC CCTCTTCCAA GGGCCCGGCG 
CCAGCCAGAC GCGCGTCGCG GCCCGCGCCG TCCTCCGTGC CGGAGATCCG GAACCAGTCT
CCGCGGCCTC ATGACGACCG CGAGGCGGAG CTGCGCGCGT TCACCACCGA CATGCTCAGC
ATGGGCGGGG TACTCCGCGA GAGCGACGAC CCGCTGATGG CGGAGGTGGT CGGTGCGACG
TTCGCCGTGC TCGATGACCT CACCGATGGC TCGGTCGGGG AGGCATTCCT TGAGGAGATC
GTGCCGCGGC TGGAGACCGC GGCGACCGGG GACGCTCTTG CGGTGCTGCT GGCGATAGGT
GCCGTCGTGC CAGGGCCGGT CAGTGCCACG GCGGATGCCG CCGCCGGCCG GCTGACCGCG
GCCGGGGTCT CCCTGCCGCA GTGGGCGGAC GAGCTTACGG TGCCGCCGCT GGCCGGCGAC
TTCCAGCGCC GGTGCGACGA CGAAGGTATC GCCGTCTTCC TGAGCTGCAC GGTCGAGCGG
GCTGGTCGGC GCCATGCGTT CCTGGTGTGC GTCGACCCAT TGGCCTGCGG TGAGGCCGGA
GACCTCCTCG TCCTCCCGGC CGAGGATCTT CCGAGGGTTC TGGAGGGGGT CACCGCGGAG
GCTCGCAAGA ACAAGGTCAG GCTCCGAACC GAGACGCTAG ACGCCGAGGA GTTCCGCTGG
CAGGTCGAGA ACGCCCTGAA TGCGCGGGCG GTGCACGACG ACGAGCTCTC GTTCGACGGG
GACGACGGGG AAGCTGCCTC CTTTCTCCGG TTCGGCGACG AGGACGAGAT CCCCTTCGGG
GAGGAGGACG GCGAGGGCGG GCCGCCGTAC GAGGCCGTGG CGCTGGTGCT GCGGTCGCGC
CTCGCGACGC TACCGACCCC GCGGAGGCCA CCCGCGCCGC ACGCGGATGA CGACGGGGAC
GAGGGAATCG ACGCGCTCGG CGGTATGGCC GACCTCGTAA AGATGCTGAA CGAGCGGGGG
GCGAACCCCG CCGAGCTCGC CCTGGCGAAC CTGACCGGAT GGCGCGGCCT GCCGGCGACG
GGCCGTCCGC CGACGCCACC GCTCCCAAAG AAGCCGAGGC GGGGCAAGGG GCAGCAGGCC
CCGGTCTACC AGGTCAAGGT CGGGCTGCGC GGCACGAAAC CACCGATCTG GCGACGGCTG
GAGGTGCCAG CCGACACCAA CCTGGCACGC CTGCACACGA TCATCCAACT GGCGTTCGAC
TGGGAAGACA GCCATCTGCA CGTCTTCGAG ACGCCTTACG GCAGATTTGG CACTCCGGAC
GTGGATCTTG GCCACCGCGA CGAGAAGTCG GTGTCGCTGG AGCAGGTGCT TCCCGACGTC
AAAGCAAAGA TTAGCTATAC CTACGATTTC GGCGACTCCT GGGAACACGA GATCGCTCTG
GAGAAGATCC TCGAACGCAG CCCATCTGTC CGGTATCCGC GCTGCACCGG CGGGCGTCGC
GCGGCCCCGC CGGAGGACTG CGGCGGTATC TGGGGCTACG AGGCGCTCCT GCAGATCCTG
GACGATCCCA GTCATCCCGA GCACCACGAG CGGCTGGAAT GGCTGGGCCT CGACGACCCC
GCCGACCTCG ACCCCACCGA GTTCTACGGA GCCGGAGTGA CGGCCGCACT GTCCCGGCTC
CGCTGA
 
Protein sequence
MSPVSRGRND RKKPSSKGPA PARRASRPAP SSVPEIRNQS PRPHDDREAE LRAFTTDMLS 
MGGVLRESDD PLMAEVVGAT FAVLDDLTDG SVGEAFLEEI VPRLETAATG DALAVLLAIG
AVVPGPVSAT ADAAAGRLTA AGVSLPQWAD ELTVPPLAGD FQRRCDDEGI AVFLSCTVER
AGRRHAFLVC VDPLACGEAG DLLVLPAEDL PRVLEGVTAE ARKNKVRLRT ETLDAEEFRW
QVENALNARA VHDDELSFDG DDGEAASFLR FGDEDEIPFG EEDGEGGPPY EAVALVLRSR
LATLPTPRRP PAPHADDDGD EGIDALGGMA DLVKMLNERG ANPAELALAN LTGWRGLPAT
GRPPTPPLPK KPRRGKGQQA PVYQVKVGLR GTKPPIWRRL EVPADTNLAR LHTIIQLAFD
WEDSHLHVFE TPYGRFGTPD VDLGHRDEKS VSLEQVLPDV KAKISYTYDF GDSWEHEIAL
EKILERSPSV RYPRCTGGRR AAPPEDCGGI WGYEALLQIL DDPSHPEHHE RLEWLGLDDP
ADLDPTEFYG AGVTAALSRL R