Gene Franean1_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3021 
Symbol 
ID5671403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3556376 
End bp3557491 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content73% 
IMG OID641241923 
Producthypothetical protein 
Protein accessionYP_001507343 
Protein GI158314835 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGCAG GCTTCGCGAC CTGCGCCGAC GGCGCGGTCT CGCTGCTGAC CCGCGTCTAC 
GACGGCGGCG CCGCCGAGGT CTCGCAGGTG GAGGGCGCGT TGCGGGCCCT GCGTGAGCTC
GCCGGGCCGC GCCGGTTCCT GCTCGTCGGC GACAGCAAGC TCGTCTCCTA CACGAACCTG
ACCGCGATCG ACGCGGCCGG CGCCACCTTC GTCGCCCCGG CCCCGCGGAC CATCGTCGGG
CCGGCCGCGC TCGCGGCGCA CGACCCGGCC ACCGCGACGA TCGTCGACTG GGCGCCCCAG
CGCGAGAAGG ACAAGCTGTT CCACCAGCGC GACGTCCACC GGGTCGTCGA GGGATCCACG
ACCCTGCGCG GCCCGAAGGC CACCGACCCG CCGTTCACGA CGCGCACCGT GTACTCCCAC
TCGGCCCGCC GTGCCGCGGC GTCCGCGGCC AGCCGGCAGA AACAGATCGA CAAGGCACGC
GCCGCGCTCG TCGTGCTGCA CCGCAACCTC GGCACCCACT ACTACCGCGA CGAGGCCGCC
GTCCACGCCC GCGTCGAGAA GATCACCAGG GAGTGTCGGG TCGGGGCGTG GCTGCGCACC
CACGTCGACA CCAACCCCGA CACCGGCAAA CCGCTGCTGA CCTGGTACTT CGACGAGGCG
GCCCTGGACC TGGCGGCGAA CGCCGACGGC TGGTTCGCGC TCCTGACAAA CCAGAGCATC
GAGGAGAAGG ACGCCGCCGG GGTCTTCGTC GACTACAAGG GCCAGGAAGC CTCCGAACGG
CGCAACAGCG CGTTCAAGGG CCCCCTCGCG GTCAACCCGT TCTACCTGGA GAACAACCAG
CGGATCCACG GGCTCCTGCA CGTCGTCGGC CTGGCGCTGC TGCTGTTCTC GCTGATCGAG
CGCGAGGCCC GCCGCGCCGC CGGCCCGACC GGGACGGTCG CCGGCCTCTA CGCCCGCCGC
CCGGCCAAGC CCACCAGCCG CCTCATCCTC GAAGCCCTCG CCGGCCTGCG CCTCGTGCCC
GCTCACGACG GCCAGCCCGC CTACATCCCC CGGCCCACCC CGCTCCAACA GCGCGTGCTC
GACCTCCTCG GAGTCGACCC GACCAAACCC CCGTGA
 
Protein sequence
MQAGFATCAD GAVSLLTRVY DGGAAEVSQV EGALRALREL AGPRRFLLVG DSKLVSYTNL 
TAIDAAGATF VAPAPRTIVG PAALAAHDPA TATIVDWAPQ REKDKLFHQR DVHRVVEGST
TLRGPKATDP PFTTRTVYSH SARRAAASAA SRQKQIDKAR AALVVLHRNL GTHYYRDEAA
VHARVEKITR ECRVGAWLRT HVDTNPDTGK PLLTWYFDEA ALDLAANADG WFALLTNQSI
EEKDAAGVFV DYKGQEASER RNSAFKGPLA VNPFYLENNQ RIHGLLHVVG LALLLFSLIE
REARRAAGPT GTVAGLYARR PAKPTSRLIL EALAGLRLVP AHDGQPAYIP RPTPLQQRVL
DLLGVDPTKP P