Gene Franean1_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3412 
Symbol 
ID5671783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4041045 
End bp4042106 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content70% 
IMG OID641242300 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_001507720 
Protein GI158315212 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.406116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCAGG TCACAGGCTC AAGCCGACTT CTGGCACCCC ACAGGCTGGC CGAGCATCCC 
GGCGCGCAGG TGATCTGCCG GGACCGGGCC GGTGCCTACG CCGAGGGCGC CCGCACCGGT
GCCCCGGACG CGGTGCAGGT GGCGGACCGC TTCCATCTGT GGAGCAATCT CGCCGGATAC
GTCGAGACGA CGGTCGCCCG GCATCGTTCC TGCCTGGCGC AACCACCGGC CACTGACGAG
GCTGCGGACG AGCCGCGTGC CGACCTTGAT GGCGCGGTGG CCGCAGCGCG GGCAGCGTCC
TTCGAACAGC GGGCGTTCGT GCGACACGCC CGCGAGCGGT ACGCCGCCGT CCAGGAACTC
AAAGCCGCAG GCGTGGGCAT CAAACCGATC GCCGCCCGAC TCGGCCTGGC CCGAGGAACG
GTCCGCAAGT ACTACCGTGC CACCAGCGTC GACGACGTCC TGGCCAAGGC CCGCGACGGC
CGCGGCTCGA TCCTGCGGCC GTGGGAGCCC TACCTCACCG AGCGGGTCAA CGCCGGGATC
ACCAACGGCA GCCAGCTGTT CGGGGAGATC CGCGACCAGG GATACACCGG GAGCAAAGCC
GTGGTCCTGA CCTACCTGCG CCCCCTCCGC GCCGGCGGCA GTACAGCCGC TCCCGCGACG
CGGACGGCGC CGAAGGTCCG CACCGTCACC CGCTGGATCC TCACGCACCC CGACCACCTG
GATGAACAGG ACACCCTCGC ATTGCAGCAG GTCCTCACCC GCTGCCAGGA CCTCCGGAAG
ACCGCCGACC ATGTCACCGC GTTCGCGCAG ATGCTCACCG GCCGCCACGG GGAGCGGCTC
AACGGGTGGA TCGCCGCCGT CGACGCCGAT GACCTGTCCG ATCTTCACCG CTTCACCCGC
GGCCTCCTAC GCGACCACGA CGCCGTTCTC AACGGACTGA CCCTGCCGCA CAGCTCCGGA
CAGGTCGAAG GCACCGTGAA CCGCATCAAA ATGATCAAGC GGCAGATGTA TGGCCGGGCG
AACTTCGACC TGCTCCGCAA ACGAGTTCTC CTCGCGACCT GA
 
Protein sequence
MPQVTGSSRL LAPHRLAEHP GAQVICRDRA GAYAEGARTG APDAVQVADR FHLWSNLAGY 
VETTVARHRS CLAQPPATDE AADEPRADLD GAVAAARAAS FEQRAFVRHA RERYAAVQEL
KAAGVGIKPI AARLGLARGT VRKYYRATSV DDVLAKARDG RGSILRPWEP YLTERVNAGI
TNGSQLFGEI RDQGYTGSKA VVLTYLRPLR AGGSTAAPAT RTAPKVRTVT RWILTHPDHL
DEQDTLALQQ VLTRCQDLRK TADHVTAFAQ MLTGRHGERL NGWIAAVDAD DLSDLHRFTR
GLLRDHDAVL NGLTLPHSSG QVEGTVNRIK MIKRQMYGRA NFDLLRKRVL LAT