Gene Franean1_1285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1285 
Symbol 
ID5669698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1550818 
End bp1552065 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content72% 
IMG OID641240217 
Producthypothetical protein 
Protein accessionYP_001505645 
Protein GI158313137 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.193911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.90193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGAC GTCTGGGTCT GCGGCTCGGG GCCGCCTTCG CCGGGGCCGC GCTGGCGCTC 
GCCGCGTGCG GCGGCGGCGA CGGCGAGCAG GGCGGTGGCA CGGCCACCAG CTCCGGCGCC
CCGGTCAAGC TGATGATCAT CGCGCCGGTC GGCACCACCG GGGCCAACCA TCCCGAGATG
GTGGCGGCGG TCCGGGCCGC CGCCCGCGGT GTCAACGAGC GTGGCGGCAT CAAGGGCCAT
CCGGTCGAGA TCCTGCACTG CAACGAGAAG AACGACCCCA CCGCGGCGAA GGAATGCGCC
CAGAAGGCGG TGGACGAGCA CGTTCTGGCC GTGGTCTCCA CCGTCAACGG CTCGGGCGGG
ATCATGCCGA TCCTCGAGGA GGCCGGCATC CCGGCGATCG GGTCGGCCGG GATCGCGGCG
GACGGCTCCG AGCTCAGCTC GGACGTCAGC TTCGTCGTCA GCCCGCTCAC CTTCTACCCG
GCCGTCTGCC CGTCGCTGCT ACGCAAGGCC GGGGCGTCCA AGATCGGGCT GGTCGGCTAC
GACCTGAGCG CGAGTGACCG CCTGATCACG ATGGCCCAGG CCGGCGGGCG CGCGGCCGGG
GCGCCGATCA ACCCCGAGTT GCGCATCCCG ATCACCAGCA GCGACCTCAC CCCGACCGTC
GCACAGCTGA GCAGGGCGGG TGCGGACGGC GCCGTCCTGG TGGTGTTCGA CCAGGCCGCC
TACGCGGTCA TCGGCGGCGG CGACCCGAAC CTGCGCACCT GCCACGCGGC CGGCACCCTC
TCCAAGGAGT ACCTCGCCAC GCTCGGGCCG GCCGCCGACA ACCTCGTCGT CGCCAGCGCG
TTCCCCGAGC TCAGCCAGGC CGCCGAGTTC CCCGAACTCA AGCGGATGAT CTCCGAAATG
GACGCCGAGG CGGCTGGGGG CGACGCCGAC GCGCGCGCCG ACCTCCGGGA TTCCACGGAG
ACCACCGGGG CGTGGCTGTC CGTCCAGATC GCCGAGAAGG TCGGCAACTC CGTCTCGGGC
GACCTGACGA CGAAGAGCCT GCTCGAGCAG CTCCGCGCGA CCAAGGGCCT CGACCTCGGC
GTGATCCCGC CGCTGGACTT CACCACACCC AACCCCATCC CGGGCGTGGA GCGCGTCTTC
AACACGACGA TGCGCGGTGC CCGCTGGAAC AGCGCCCAGC ACACCTTCGT CCCGCTCGGG
CCGGAGACCT ACGAGGCGCT CGGCCTGCTG ACCCGCGGCG CTTCCTGA
 
Protein sequence
MRRRLGLRLG AAFAGAALAL AACGGGDGEQ GGGTATSSGA PVKLMIIAPV GTTGANHPEM 
VAAVRAAARG VNERGGIKGH PVEILHCNEK NDPTAAKECA QKAVDEHVLA VVSTVNGSGG
IMPILEEAGI PAIGSAGIAA DGSELSSDVS FVVSPLTFYP AVCPSLLRKA GASKIGLVGY
DLSASDRLIT MAQAGGRAAG APINPELRIP ITSSDLTPTV AQLSRAGADG AVLVVFDQAA
YAVIGGGDPN LRTCHAAGTL SKEYLATLGP AADNLVVASA FPELSQAAEF PELKRMISEM
DAEAAGGDAD ARADLRDSTE TTGAWLSVQI AEKVGNSVSG DLTTKSLLEQ LRATKGLDLG
VIPPLDFTTP NPIPGVERVF NTTMRGARWN SAQHTFVPLG PETYEALGLL TRGAS