Gene Franean1_2616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2616 
Symbol 
ID5671010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3096138 
End bp3097856 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content70% 
IMG OID641241532 
Productintegrase family protein 
Protein accessionYP_001506952 
Protein GI158314444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTACGT GCTCGGGTTG GCCCGTCTGC CCGAGCGGAA GGCCGACCTT CTCGCCGTCG 
TCCGCACACT GTGGATCTAC CGGGAGTGTG TTCCGGTCGA GTGCCGACTC GGTGGCTACC
CGTGGGCGGG CACGGCCGCC AGGGATCTGG TGCGCATGCC ACCGGCGGGC CGGGAGAACA
AGACCCCCGC GGATCGCGGC GCCGACGATG GAAGCGCTGC TGGGCTGGGC GCTGACCATG
GTCGAGACGA TCGGGCCGGA CATCCGCGAC GCCTGGAAGG AGCTGGCCCG GCTCGAGGCC
GGCACCCACC CGTCCCAGCG GATCTACGAC GGGCTGAGTG TGCCCGCCCG GCTCGACCTG
TTCGTCCGGC GGGCCGCCGA GAACGGAACC CCGCTCCCCG GACGCAGCAC CGCAGGCAAG
GCCGTCGCCG TCAACGGTGC CCACGTTCTC CGCCTCGTCG GCGTCCCACC GGACAAACGA
TTCGGCCTGC CGCCACAGCA GCGCGCTCTG CTGGAGGAGT CCGGACTCCC GGTGGCGACC
AACACCTACA TCGGTACGAT CACAGGACAC ATCGACCGCG TTCCCTGGCG GGCCGAGCCC
ATCAGCGTCC CGGAGCTACC CACCATGATC AGGATGCTTT ACGCGTCTGC GTTCGTGGTC
ATCTGCTACC TGTCCGGCAT GCGGCCCGGC GAGGTTCTCA ACCTGCCCCA CGGCTGCCGC
GACAGCGACC CCCGCACCGG CGAACTGCTG CTACGCGGCC GGCGCGGCAA AGGCTACGAC
CGCGGCCCAC TGACCCGACA CGCAGAACCC GGCCGACCGT GGGTCGTCGT CACACCGGTC
CACAGCGCCG TCGAGATGCT GGAGAACCTG GCTGATTTCC CGTTCCTGTT CCCGGCAAGC
CCGATCATGG CCCACGCCCA TCGGGCGAAC ACGACCCACG CCCGCTCCAC CGGGGCGATC
AACCGGGATC TGGAAGACCT CACCACCTGG ATCAACACCA CGTTCGCCCG CCCCGACGGC
GACGCGCCGA TCCCACCGGA CCCGACCAAG CACCTGCACG CAGCCCGGTT CCGGCGGACG
CTGGCCTATT TCATCGTGCG CCGGCCCCGT GGCCTCGTCG CGGCGGCGCT CCAGTACGGA
CATCTCCATA CCAAGGTCAC CCTCAACTAC GCCGGTGACG CGGACACGTC CTGGCTCGAC
GATCTCCCCG TCGAACGCCT GGAGATGATC CTCGAACAGG TCGACACAGA TGCCCGGCTC
CTCGAGAACG ACGAGCACGT CAGCGGCCCC GCCGCCGCCG ACTACCACGC CCGCATCACC
CGGGCAGCAC GGTTCTCCGG CCGCGTCGTC AACCAGACCC GCAACGCCCA GCGGCTCCTC
GCCAGCCTCG ACCCGGACAT CCACCACGGC GACGGCATCA CCTGCGTCTA CCGCGCCGAA
ACAGCCGAAT GCCGGCGCAT CCTCGCCAGC CAGGGACTCG CCGCCGACAG CCCGCGAGAA
TCCGAGTGCC GATCCTCCTG CACCAACCTC GCTTTCACCG ACCGGGCGGT TGACCAGCTC
CATGCCCGGC TCACCCACCT CGACGCCACC GCCGATCACT CCCTGACGCC GCAACCGCTC
CGCGACCGCG CCCAAGCCCA GGCGAACGCC ACGAGAGCCG TCATCGACCG ACACGTCGCG
TCGTCCAGCC ACCTGACAGA ACCGGCAGGA CAACGATGA
 
Protein sequence
MPTCSGWPVC PSGRPTFSPS SAHCGSTGSV FRSSADSVAT RGRARPPGIW CACHRRAGRT 
RPPRIAAPTM EALLGWALTM VETIGPDIRD AWKELARLEA GTHPSQRIYD GLSVPARLDL
FVRRAAENGT PLPGRSTAGK AVAVNGAHVL RLVGVPPDKR FGLPPQQRAL LEESGLPVAT
NTYIGTITGH IDRVPWRAEP ISVPELPTMI RMLYASAFVV ICYLSGMRPG EVLNLPHGCR
DSDPRTGELL LRGRRGKGYD RGPLTRHAEP GRPWVVVTPV HSAVEMLENL ADFPFLFPAS
PIMAHAHRAN TTHARSTGAI NRDLEDLTTW INTTFARPDG DAPIPPDPTK HLHAARFRRT
LAYFIVRRPR GLVAAALQYG HLHTKVTLNY AGDADTSWLD DLPVERLEMI LEQVDTDARL
LENDEHVSGP AAADYHARIT RAARFSGRVV NQTRNAQRLL ASLDPDIHHG DGITCVYRAE
TAECRRILAS QGLAADSPRE SECRSSCTNL AFTDRAVDQL HARLTHLDAT ADHSLTPQPL
RDRAQAQANA TRAVIDRHVA SSSHLTEPAG QR