Gene Francci3_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1620 
Symbol 
ID3905899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1943946 
End bp1945151 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID637878958 
Productextracellular ligand-binding receptor 
Protein accessionYP_480725 
Protein GI86740325 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.368561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.738285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGTC CTGGACGGCG AGGGGGCGTG CCCCGACCGA GCGTGACGCG GTGGGGGGCC 
CTCGGATCCG TGGGTGTGCT GACAGCCGCG GCCGTCCTTG CCGGCTGCGG CGGTGGTTCG
TCCGGTGACG ACGGGAAAAA GGAATACGCC ATCGGTTTTC AGGGGCCGCT CTCCGGCGAC
AACCAGCAGC TCGGCATCAA CGCCTACGAC GGGGTGCTGA CCGCTGTCGA GCTAGCGAAC
CGGCGCAAGG ATCTGCCGTT CAGGCTGCGC CTGGTCGCCT CGGACGATCA GGGTATGGCC
GAGCAGGGGC CCACCGCCGC GCAGAAACTG ATCGACAATC CGGAGGTCAT CGGCGTCGTC
GGCCCCGTCT TCTCCGGACC GACGAAGTCG AGTGAGCCGC TCTACAGCGG GGCCGGGCTG
CTGTCGGTCA GCCCGTCGGC CACCAATCCG GCGCTCACCG ATCTCGGGTT CACCAGCTTC
TACCGGGTCA TCGCACCGGA CACCGTGCAG GGATCCGCCG CCGCGGAATA CCTTGCCAAG
GTCGTGAAGG CGGACAAGGT CTACTCTCTC GACGACCGGA GCGAATACGG CACCGGCTTG
TCCGGAGCGC TCGAGAAGGC CCTGACCGGC CGTGGCATCC GCGTGATCCA CGACGGCATC
AATCCGACGA AGGACTACAC GTCCCAGGCC ACGAAGATCC TCGCCGAGAA TCCGGACGCC
GTGTACTATT CTGGCTACTA TGCGGAACTC GCGTTGCTGA CCAGGGCGCT GCGCAGCAAG
GGGTACACCG GGAAGGTCGT CAGCGGCGAC GGCGCGAACG ACGACCAACT CATCCACCAG
GCCGGTGCCG GCAACGCCGA GGGAACGCTG CTGACCTGCC CCTGCGGTGA CCCGAACAGC
GATCCCGCGG CGGCGGGGTT CGTCGCCGAC TACAAGACGA TCAACGCCGA CGCGCGGCCT
GGAACCTATT CCGGCGAGGC TTATGACGCC ACGAACGCCG TCATCGAGGT GCTGCGCCGG
CTCGGTAGCG GCGCGACGCG GGAGGCCGTG CTCGCCCGGT TCGGCTCGGT CGACATTCCT
GGCGTCACCA AGCGCATCAG ATTCCGGAAG AATGGTGAGG TCGAGGGCTC GACGGTCTAC
GTGTACGAGG TCCGGGCCGG GAAACGGGCC GTGCTCGGCC CGGTCAGCTC CCTCGTCAGA
CCGTAA
 
Protein sequence
MTGPGRRGGV PRPSVTRWGA LGSVGVLTAA AVLAGCGGGS SGDDGKKEYA IGFQGPLSGD 
NQQLGINAYD GVLTAVELAN RRKDLPFRLR LVASDDQGMA EQGPTAAQKL IDNPEVIGVV
GPVFSGPTKS SEPLYSGAGL LSVSPSATNP ALTDLGFTSF YRVIAPDTVQ GSAAAEYLAK
VVKADKVYSL DDRSEYGTGL SGALEKALTG RGIRVIHDGI NPTKDYTSQA TKILAENPDA
VYYSGYYAEL ALLTRALRSK GYTGKVVSGD GANDDQLIHQ AGAGNAEGTL LTCPCGDPNS
DPAAAGFVAD YKTINADARP GTYSGEAYDA TNAVIEVLRR LGSGATREAV LARFGSVDIP
GVTKRIRFRK NGEVEGSTVY VYEVRAGKRA VLGPVSSLVR P