Gene Franean1_5580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5580 
Symbol 
ID5673908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6760126 
End bp6761490 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content73% 
IMG OID641244434 
Producthypothetical protein 
Protein accessionYP_001509838 
Protein GI158317330 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.647703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATCT CACCGGTTAC GGAGGCGGCA GGCGGATTGT CCGCCGCCGT CCGCCGCCGG 
CGCACACGGC GCCGGCCTCA CCTGGCGAGC CGTCAACGGA CGCGCCTGTT CACGCGCCTG
CGGACTCGCC TCCGGACCCT GCCGATCGTG CTCGCGGCGC TGCTCGTGGC CGCCGGGTGT
GTCTCGTGCG TGTCCTGGAA CGACTCGTCC GATGCCACGG ACTCGCACGG TTCGGGTGCC
TCGTGCCCGC CCGTGCCGGG GGTCACGGCG GACGAGGTCC GGCTCGGCCT GCTGTTCCCG
AACACCGGCA ACGCGGCGTC GTTGTTCGAC CCGTTCCGGG CCGGGGTCGA CGCCCGGCTC
GGGGTGGCGA ACGCGGCCGG TGGGGTCCAG GGGCGGAGCA TCGAGTACTC GTGGCGCGAC
GACGAATCGC AGCCCAGCGT CAACGAGACC GCGGCCCGCA TGCTCGTCGA CCAGCACCAG
GTGTTCGGGA TCGTCGAGTC CACCACGGCC GCGGCCGGCT CGGCGGAGTT CCTGCACAGC
CGCGGCATCC CGGTCACCGG AACGTCGCTG GAGGCGTCCT GGACCACCTT CGACAACATG
TTCAGCTACT CGAACATGAT CGCGGACGGT GCCTCGGTGT CGACCTGGGG TGACTTCGTC
GCCGAGCGCG GGGGGACGAC GGCGCTGATC GCCGCGTCGA GCTTCTCGGC CGCCTCGGGC
GCCTTCGGCG AGGAGCTGGC CGCCAGCCTG GAGGCGGCCG GGGTGCGGGT CGTCGGCACC
CTCGACGCGA CCGGGCCGAT CGACTTCGCC GACGTCGGCG CGCAGGTGCG TGACAGCGGC
GCCGACACGC TCGTCGGGGC GGTCACGGGG GCCGCGTTCG GCCAGGTCGT GCTGGGCGCC
CGGGGGGCCG GGGCCAACCT GCGGGTGATC CTCTCGCCGT CCGGGTACGA CCAGAGCCTG
CTGGACGTCT TCGGCCGGGT GCTGAGCGGC GTCTACATCT TCGTCGACTA CCAGCCGTTC
GAGCTCGACA CGCCGGGCCA CCGCGCCTTC CTCGACGCGA TGACGCGGTA CGCCCCGTAC
CTGCAGTCGC CGAACAGGCA GGCCGCGCTC TCCGGCTGGA TCTCGACCGA CATGTTCCTG
CGCGGGCTGG CCGAGGCCGG CCGGTGCCCG ACCCGGGAGC GCTTCATCGA GGGGCTGCGC
GCCGTCCGCG ACTACGCCGC CGACGGGCTG CTGCCGGCGC CCATCGACTT CACGGCCTCC
TTCGGTCAGC TCGGCCGGTG CTACACCTTC CTTCAGGTGG CTCCCGACGC GAGCCGCTTC
GACGTGCTCC GGCCGGCACC GCGCTGCGGC CGGCTCGAGC ACTGA
 
Protein sequence
MRISPVTEAA GGLSAAVRRR RTRRRPHLAS RQRTRLFTRL RTRLRTLPIV LAALLVAAGC 
VSCVSWNDSS DATDSHGSGA SCPPVPGVTA DEVRLGLLFP NTGNAASLFD PFRAGVDARL
GVANAAGGVQ GRSIEYSWRD DESQPSVNET AARMLVDQHQ VFGIVESTTA AAGSAEFLHS
RGIPVTGTSL EASWTTFDNM FSYSNMIADG ASVSTWGDFV AERGGTTALI AASSFSAASG
AFGEELAASL EAAGVRVVGT LDATGPIDFA DVGAQVRDSG ADTLVGAVTG AAFGQVVLGA
RGAGANLRVI LSPSGYDQSL LDVFGRVLSG VYIFVDYQPF ELDTPGHRAF LDAMTRYAPY
LQSPNRQAAL SGWISTDMFL RGLAEAGRCP TRERFIEGLR AVRDYAADGL LPAPIDFTAS
FGQLGRCYTF LQVAPDASRF DVLRPAPRCG RLEH