Gene Franean1_3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3065 
Symbol 
ID5671444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3619848 
End bp3621182 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID641241963 
Producttransposase 
Protein accessionYP_001507383 
Protein GI158314875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCA TGGGGGATGT TCTGGACCAG CTGTCTCGTC GGTTCGCGGT GGTGCTGCCG 
CACCTCGACG AGCGGCAGCG GCGACTGGTG CTGGCAACCG AAGCCCGGCT GCTGGGGCAT
GGCGGTATCC GCGCGGTGGC CCGAGTCGCC GGTGTCAGCG AGACCACGGT CCGCGTCGGC
GTGTTCGAGC TCGAAGCGGG CGGGGAACCC CTGCCCGATC GACGAGTCCG CCGGCCAGGT
GGGGGCCGTA AACGCATCGA GGACACCGAC CCGGCCGTGG TGACAGCGCT CCTCGCACTC
GTCGAGCCGG ACGAGCGAGG TGATCCGACC TCACCGCTGC GGTGGACCAC GAAGTCGCTA
CGACACCTCG CCGAACAGCT CACCCGCCAA GGGCACCCGG TATCGCCGTC GACAGTCCGC
CGGCTCTTGC AGGCGGCTGG TTTCAGCCTG CAGGCGAACT CCAAAACCCT GGAAGGAAAG
CAGCACCCCG ACCGGGACGC CCAGTTCCGC TACCTGAACA ACCAGGTCAT GGAACATCAG
AAAGTCGGCG AGCCGGTCAT CAGCGTGGAC GCGAAAAAGA AGGAGATGCT CGGCCAGCTC
CCGAACCCGG GCCGTGAATG GCGACCGAAA GGCGACCCTG TCCAGGTCGA GGATCACAGC
TTCTTCACCG GCCCGCAGGG CGACACCGCC ATCCCCTACG GCGTCTACGA CCTGACCACC
GACGCCGGCT GGGTCAACGT CGGGGTCGAC CACGACACCT CAGCGTTCGC GGTGGCCTCG
ATCCGCCGCT GGTGGCAGGC CCGCGGCCAG GCCGACTACC CCCAGGCCAC CCGGCTGCTG
GTCACCGCGG ACGCGGGCGG GTCGAATAGC TACCGCTTTC GAGCTTGGAA AGCCGAACTC
GCCGCGCTCG CCGCCGACAC CGGCCTGACG ATCACCGTGT GTCATTTTCC GCCCGGCACG
TCGAAATGGA GTCGCGGTAG GGACCGCCCT TGCGGGCGGC CCCCCGCACA GATCCCAGCG
TGCGGGACTA CCGCACTGGG CTCCTGCCTC AGGTTCTGGC TGCGAAGCGT CTCTCCGGGA
AGGGATGCAT CACTCGGACT GGGGGTAGCC ATCGAGCCGC GATCCGCCCC ATCCGTTGCC
AGTTCATCCG GTCACGTTGG CTGCGGCGCC GCAGCGCCTT GCACCAGTGC CGTGTCACCT
GGGTACGGAA GGCCGACATC GTGTCGGTGT TGCCGGGCAC GGCGTAGTAG GCCATATGCC
CACGTAGCAC GCTCGCCAAC CAGCGTCCCT GATCCGGGAT GGGCGTATGC CGGCGACGCT
TCAGCTGCTC ATTGA
 
Protein sequence
MAIMGDVLDQ LSRRFAVVLP HLDERQRRLV LATEARLLGH GGIRAVARVA GVSETTVRVG 
VFELEAGGEP LPDRRVRRPG GGRKRIEDTD PAVVTALLAL VEPDERGDPT SPLRWTTKSL
RHLAEQLTRQ GHPVSPSTVR RLLQAAGFSL QANSKTLEGK QHPDRDAQFR YLNNQVMEHQ
KVGEPVISVD AKKKEMLGQL PNPGREWRPK GDPVQVEDHS FFTGPQGDTA IPYGVYDLTT
DAGWVNVGVD HDTSAFAVAS IRRWWQARGQ ADYPQATRLL VTADAGGSNS YRFRAWKAEL
AALAADTGLT ITVCHFPPGT SKWSRGRDRP CGRPPAQIPA CGTTALGSCL RFWLRSVSPG
RDASLGLGVA IEPRSAPSVA SSSGHVGCGA AAPCTSAVSP GYGRPTSCRC CRARRSRPYA
HVARSPTSVP DPGWAYAGDA SAAH